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PROHORMONE CONVERTASE TRANSFORMED CELLS 



B^VBronnd of the Invention 

5 

plflrt of the Invention 

This invention relates to host cells expressing one or 
more prohormone convertase enzymes and the production of a 
biologically active polypeptide from these cells. The 
10 invention further relates to polypeptide precursor variants 
having endoprotease cleavage sites which are processed by 

the host cell. 
Deaeription of Related — 

Most, if not all proteinaceous hormones are 

15 synthesized as relatively large precursor molecules, 
prohormones, that are biologically inactive (reviewed in 
Docherty and Steiner 1982; Loh ec al . 1984; Mains ec al. 
1990) . Maturation of the prohormone to its active form 
often requires endoproteolytic cleavage at paired or 

20 multiple basic amino acid residues tc liberate the active 
component from the inactive portion of the precursor 
molecule. Until recently, almost nothing was known 
concerning the identity of the proteins responsible for 
this important stage in processing: the prohormone 

25 convertase (PC) enzymes. 

Not every kind of cell has the capacity to correctly 
convert a prohormone to its active mature form through 
. these specific cleavages. For some classes of prohormone 
this processing is apparently limited to those cells that 

30 contain both constitutive and regulated pathways of protein 
secretion (Gumbiner and Kelly 1982). Cells having both 
constitutive and regulated pathways of protein secretion 
are located almost exclusively in the specialized hormone- 
producing tissues of the endocrine and neuroendocrine 

35 systems . 

An example of a family of hormones that is processed 
during regulated secretion is the insulin family of 
hormones. This family includes insulin, the insulin-like 
growth factors IGF-I and IGF- II. and relaxin. In their 
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to yield active hormone, 

When heterologously expressed in cells having only a 
constitutive pathway of protein secretion, most hormone 
precursors, such as human preproinsulin, are secreted in an 
unprocessed prohormone form (Gumbiner and Kelly 1982). 
Experimental manipulation of mouse AtT-20 cells that 
disrupted the regulated secretory pathway of those cells 
has been observed to redirect the polyhormone precursor 
proopiomelanocortin (POMC) into the constitutive secretory 
• pathway. In that case, POMC was no longer subjected to 
processing and was found to be secreted from the cell as 
the intact precursor (Moore et al. 1983). These 
is observations suggest that there is a class of processing 
enzymes that function only in the regulated pathway of 
protein secretion; this pathway is apparently limited to 
certain highly-specialized cell types. 

The POMC protein is a prohormone that is subject to 
differential processing. Expression of mature POMC 
derivatives is highly tissue-specific; alternate processed 
forms of the same prohormone precursor are produced in 
different regions of the brain (Douglass ec al. 1984). The 
enzyme (s) and control mechanisms involved in the generation 
of this diversity are unknown. The possibility exists that 
there are tissue-specific enzymes that recognize unique 
amino acid sites on the prohormone substrate, or 
alternatively, thac only one enzyme is responsible for the 
endoproteolytic cleavages and is itself under tight 
metabolic control, with each tissue providing a 
characteristic intracellular environment that is associated 
with cleavage at a specific subset of residue pairs. 

Until recently, the only known eukaryotic prohormone 
processing enzyme was the KEX2 gene product of the yeast 
Saccharomyces cerevisiae (Julius et al. 1983 and 1984? 
Fuller et al. 1989a). The kex2 protein is a serine 
protease related to the subtilisin family of enzymes and 
has a preference for specific pairs of basic amino acids on 
its native hormone precursor substrates (pro-a-f actor 
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mating type pheromone and the pro-killer toxin) . Kex2 shows 
maximum enzymatic activity at neutral pH with a strict 
requirement for the presence of calcium (Julius et al. 
1984; Fuller et al. 1989a)- It is membrane-bound and the 
5 mature, active form of the enzyme is localized in the post- 
Golgi compartment of the yeast cell (Fuller et al. 1989b; 
Redding et al. 1991). It can effectively serve as a 
substitute convertase for bona fide mammalian PC enzymes 
when heterologously expressed in otherwise processing- 

10 deficient cells by its demonstrated ability to correctly 
process certain mammalian prohormones: Nerve growth factor, 
bNGF, in BSC-40 cells (Bresnahan ec al. 1990); protein C in 
baby hamster kidney BHK cells (Foster ec al. 1991); POMC 
both in BSC-40 cells (Thomas et al. 1988) and in COS-1 

15 cells (Zollinger ec al. 1990)]. Kex2 was shown to have a 
highly similar if not identical substrate specificity to 
■ the authentic human proalbumin convertase in vitro 
(Bathurst et al. 1986; Brennan ec al. 1990). When 
heterologously expressed in mammalian cells kex2 will home 

20 to the post-Golgi compartment, where it is apparently fully 
active (Germain ec al. 1990). These observations have led 
to speculation that this yeast protein must be both 
functionally and structurally similar to an authentic 
mammalian convertase. A search began for the elusive 

25 mammalian counterparts of kex2 based upon structural 
homologies . 

KEX2 and Fur Hydrophobic Anchor 

The Kex2 endoprotease has two hydrophobic regions 
located at the N-terminal side and C-terminal side. The C- 

30 terminal hydrophobic transmembrane anchor is responsible 
for the anchoring of the Kex2 to a Golgi body of a yeast 
cell. Deletion of this C- terminal hydrophobic anchor 
renders the Kex2 endoprotease soluble* while still 
maintaining substrate specificity (EPO PUB No. 0327377). 

35 An inspection of genetic data bases identified a 

potential mammalian homologue of kex2 that shared many 
features of the active site domain of the kex2 protein: the 
fur gene product of human liver, furin (Fuller et al. 
1989b) . Furin was subsequently cloned and successfully 
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expressed in processing-deficient cells: cotransfection of 

. „ , , j * i n ros-l cells (van 

den ec"^ 

African green monkey kidney epithelial BSC-40 cells 
5 (Bresnahan ec aJ. 1990) resulted in correct processing of 
the precursor substrates. 

However, when furin and a prohormone, prorenin, have 
been coexpressed in mammalian cells, no processing has been 
observed (Hatsuzawa, ec al The . TfTiinffil of Fimtw i ca l 
10 Rt-.TV Vol 265 [1990] ) . 

Furin also shared with kex2 a requirement for calcium 
ions, displayed maximum activity at a neutral pH, and, like 
kex2, was shown to be a membrane-bound protein in the post- 
Golgi compartment (Bresnahan et al. 1990). However, 
because furin does not seem to be capable of efficiently 
processing certain hormone precursors such as prorenin, and 
its mRNA message is apparently expressed in most if not all 
mammalian cells (Hatsuzawa et al. 1990). the furin protease 
may play a role in an essential -housekeeping- function in 
that it could be responsible for many of the basic amino 
acid site cleavages occurring in the constitutive secretory 
pathway of the cell. These functions could be general, .or 
more confined to specific cell-types with the constitutive 
pathway-dependent processing of. growth factors like bNGP. 

Because furin does not appear to be directly involved 
in the endoproteolytic processing of prohormones in 
endocrine-like tissues, there must be other mammalian 
proteins both functionally and structurally similar to 
furin and yeast kex2 that serve as the authentic prohormone 
converses: PC proteins that share distinctive homologies 
in their active sites. This search for structural homology 
has recently lead to the discovery of more PC proteins. 

By using the technique of -Mixed oligonucleotides 
Primed Amplification of cDNA" ■ (MOPAC, Innis 1990), Smeekens 
and Steiner (1990) were able to use the conservation of 
amino acid sequence surrounding the active sites of both 
bacterial subtilisin and the yeast kex2 protease to amplify 
a putative prohormone convertase CDNA from human 
insulinoma. This PC cDNA, termed mPC2, showed an 
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exceptional degree of homology co the yeast kex2 protease. 
In similar experiments Seidah and colleagues identified 
another member of the subtilisin family of proteases, 
termed, mPCI (Seidah, et al 1991). 

The distribution of PCI and PC 2 has so far been 
observed to be confined to neuroendocrine-derived tissues 
(Seidah et al. 1990; Seidah et al. 1991; Smeekens et al. 
1991), suggesting that these proteins may be candidates for 
the authentic convertases resident in the regulated 
secretory pathways of these tissues. The substrate 
specificities of PCI and PC 2 for defined pairs of dibasic 
amino acids at the prohormone cleavage sites do not appear 
to be identical (Benjannet et al. 1991; Thomas et al. 
1991), nor do they share the same pattern of tissue 
distribution in the brain (Seidah et al. 1991). This 
implies that different classes of prohormones may require 
unique PC enzymes, and that both PCI and PC 2 may be members 
of a family of specific processing enzymes employed by the 
endocrine system to generate the diversity of hormones 
20 required throughout the entire organism. 
Precursor processing 

The biosynthetic process begins with the synthesis of 
the precursor (prepropeptide) on the rough endoplasmic 
reticulum (RER) . The signal peptide (pre-portion) is 
25 clipped off as the proprotein is transported into the 
cisternae of the RER where protein folding, di-sulfide bond 
formation and asparagine-linked glycosylation occur (Figure 
1) . The precursor is then translocated to the Golgi 
apparatus where more complex glycosylation and 
30 phosphorylation occur. Within the Golgi of some cells, 
proteins are sorted by an unknown mechanism into two 
groups: those that will be constitutively secreted and 
• those that undergo regulated secretion. The constitutively 
secreted proteins will enter vesicles and be transported to 
35 their target continually without the need for any specific 
stimulus. Proteins undergoing regulated secretion are 
transported to secretory vesicles and will be released* only 
when an adequate stimulus is provided. Within the 
secretory vesicles the excision of bioactive peptides from 
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the larger inactive protein precursors occurs. Two steps 
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usually at the carboxyl side of paired basic amino acids 
(e.g. lys-arg, arg-arg) and exoproteolytic cleavage of 
5 flanking basic amino acids by carboxyl- and/or 
aminopeptidases {reviewed in Mains et al. 1990.). 

Many proteins, including NGF (Berger and shooter 
1977), are first synthesized as larger precursor proteins. 
The function of the precursor is still not well understood. 
10 One possible role of the precursor is to aid in protein 
folding (Steiner 1982, Selby et al. 1987. Wise et al. 
1988). The precursor may also direct the protein to the 
proper location or pathway within the cell as is suggested 
by Sevarino and colleagues (1989). though this is not the 
is case with the connecting peptide of insulin (Powell et al. 
1988, Gross et al. 1990). Additionally, the precursor has 
been shown to have roles in gamma carboxylation of glutamic 
acid residues (Pan and Price 1985, Furie and Furie 1988), 
and regulation of the coordinate synthesis of multiple 
20 mature peptides from a single precursor polypeptide eg., 
POMC. (For review, see Douglass et al. 1984). 
Relaxin 

In the present invention, prorelaxin is used as a 
typical hormone precursor. . The relaxin is first 

25 synthesized as a preprohormone precursor which undergoes 
specific processing to form the mature two-chain, 
disulfide-linked active relaxin. A major part of this 
processing requires endoproteolytic cleavage at specific 
pairs of basic amino acid residues. This specific 

30 processing does not occur when the precursor is 
heterologously expressed in cells containing only the 
constitutive pathway of protein secretion. Mature human 
relaxin 'is an ovarian hormonal peptide of ' approximately 
6000 daltons in molecular weight known to be responsible 

35 for remodeling the reproductive tract before parturition, 
thus facilitating the birth process. Hisaw. F.L., STOSU 
- nr rr ««m MPd.. 23:661-663 (1926); Schwabe. C. et 
al., jj^- «<nnh™ COM.. 2i: 503-570 (1977); 

• James, R. et al.. HBWft, 2£1: 544-546 (1977). .This 
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protein appears co modulate the restructuring of connective 
tissues in target organs to obtain the required changes in 
organ structure during pregnancy and parturition. Some of 
the important roles for relaxin as a pregnancy hormone 
5 include inhibition of premature labor and cervical ripening 
at parturition. While predominantly a hormone of 
pregnancy, relaxin has also been detected in the non- 
pregnant female as well as in the male. Bryant -Greenwood, 

G.D., RnflnrririP «»vi«wa. 3: 62-90 (1982) and Weiss, G. , 
10 Ann. Rev. Physiol. . A&: 43-52 (1984). 

Relaxin consists of two polypeptide chains, referred 
to as A and B. joined by disulfide bonds with an intra- 
chain disulfide loop in the A-chain in a manner analogous 
co that of insulin. Two human genes (HI and H2) for human 
IS relaxin have been identified, and only H2 is expressed in 
the ovary. Porcine relaxin. the sequence of which has also 
been determined, has been used in human clinical trials for 
ripening of the cervix and induction of labor. MacLennan et 
al., nhsr.Pr.rics f- ny^raloa\'. ££= 598 (1986). 
20 European Pat. Publ. No. 86,649 published Aug. 24, 1983 

discloses how co prepare porcine preprorelaxin, porcine 
prorelaxin. and porcine relaxin. Australian Pat. No. 
561,670 issued Aug. 26. 1987, European Pat. Publ. No. 
68,375 published January 5. 1983, and Haley et al.. DNA, 1: 
25 155-162 (1982) disclose how to prepare porcine relaxin. 
European Pat. Publ. Nos. 101.309 published Feb. 22, 1984 
and 112.149 published June 27. 1984 respectively disclose 
the molecular cloning and characterization of a gene 
sequence coding for human relaxin and human H2 -relaxin and 
30 analogs thereof. U.S. Pat. No. 4.267.101 issued May 12. 
1981 discloses a process for obtaining human relaxin from 
fetal membranes. 
Nerve Growth Factor 

Nerve growth factor (NGF), required for sympathetic 
35 and sensory neuron survival (Levi-Montalcini and Booker 
1960, Gorin and Johnson 1979). is the most well 
characterized neurotrophic factor in part due to* the 
exceptionally high levels synthesized in the male mouse 
submaxillary gland. It was from this source that NGF was 
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purified (Cohen 1960) and its complete amino acid sequence 

gene by screening a mouse submaxillary gland cDNA library 
5 using a probe constructed for a hexapeptide based on the 
least degenerate codons contained within the NGF sequence. 
Molecular cloning, using the polymerase chain reaction 
(PGR) (Saiki et .1.1985. Mullis et al. 1986). has recently 
revealed that NGF is a member of a family of neurotrophic 
10 factors that also includes BDNF (Leibrock et al. 1989) and 
MT -3 (Hohn et al. 1990. Rosenthal et al. 1990, Maisonpierre 
et al. 1990a. Ernfors et al. 1990. Jones and Reichardt 

19901 • as 
NGF, BDNF and NT-3 are all translated as 

15 preproproteins that require endoproteolytic cleavage in 
order to be active. There is an extensive amount of 
homology 050%) between the mature portions of NGF ^ BDNF , 
and NT3. However the homologies of the precursor portions 
are much lower (-20%) . The efficiency of * 
20 three neurotrophic factors, from their inactive 

the active factor, vary substantially with different cell 
types. 

IMU 'lu„ is . poiypepUae.hor.one which is proceed in 
« the beta cells of the islets of Langerhans situated in the 
Screes of all vertebrates, xnsulin is -creted d,«ct y 
Lo the bloodstream where it regulates 
metabolism, influences the synthesis of protein «-of«». 
^ the formation and storage of neutral lipids. OtJUk 
,. Tnd^lOth edition, 1983. insulin promotes anabolic 
P^es and inhibits catabolic ones in muscle, liver and 
adipose tissue. The structure of human, insulin, was 
dLclosea in Hature 187.483 .I960,. «or to the discovery 
^recc^inant n technology, the major source of insula 
1S for human consumption was the pancreases of slaughtered 
animals. Human insulin was among the first ^ I 
health care products produced by recombinant 
review of the research, development, and recombinant 
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production of human insulin is in Science 2JJL — 612-637 

In addition to its role in vivo, insulin is also 
useful in recombinant cell culture. Insulin is an example 
5 of a polypeptide factor important for mammalian cell culture 
proliferation and anabolism. Some cell cultures produce 
endogenous insulin and some do not; cell cultures which rely 
on added insulin are problematic because insulin is unstable 
in some cultures. European publication number, 0307247A2, 

10 published March 3, 1989, describes the introduction of 
nucleic acid encoding insulin into a mammalian host cell to 
eliminate the need for adding exogenous insulin. 

Insulin is synthesized as a larger precursor protein, 
proinsulin. Proinsulin is a single polypeptide chain 

15 containing a sequence of about thirty residues that is 
absent from mature insulin. Proinsulin, like prorelaxin, 
has a B-C-A chain structure. The C or connecting peptide 
joins the carboxyl end of the B chain and the amino terminus 
of the A chain of the future insulin molecule piocfremistrv 

20 3rd edition, pg . 995 (1986). The mature insulin is 
generated by cleavage of the C peptide at dibasic residues 
Arg(3D- Arg(32) and Lys ( 64 ) -Arg ( 65 ) . Two distinct 
processing enzymes have been defined which are specific for 
th&ir respective dibasic cleavage sites in proinsulin; type 

25 I is substrate specific for the BC junction, while type II 
is specific for the CA junction (Weiss, Biochemistry 29, 
1990) . 

Naturally occurring mutations in the human insulin 
gene have been reported by Steiner et al. (Pififrfttes Care vol. 

3b 13, no. 6 pg. 600-609 [1990)). Members of a family with 
hyperproinsulinemia have a substitution of insulin B chain 
•* residue 10, a histidine, with aspartic acid resulting in a 
proinsulin that is reported to exhibit altered subcellular 
sorting behavior. In patients having hyperproinsulinemia, a 

35 significant proportion of the newly synthesized Asp-10 
proinsulin is secreted from the islets in an unprocessed 
form via an unregulated or constitutive protein secretory 
pathway (Steiner et al- pnas USA, vol. 85, pg. 8943-8947 
[1988]) and {Quinn et al. The J. Cell Bio, vol. 113, pg. 
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,87-996 119911). others have shown that insulin containing 
this BIO Zo station results in a more active formof 
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mr et aT^S-HSA vol. 84, pg. 6408-641X 
(Schwartz, et al. 2 [1988); shoelson 

Brange et al. UiU^ vol. 33 pg. 6 5 

et ai. vol. aiafito^vol. 31 P g. 1757 - i ; 67 J 5 1 l 9 g 92 5 ; ) 5 992 „ 
et al . (£m **L-El^^ -I. 5 no. 6 pg 519-525 [199 1 
report that the replacement of B-chaxn res d£ 1 < a 
histidine. with aspartic acid increased the • t * llXty ifc 
insulin. Kild-typ« human insulin 
interactions and forms dimers; dimer.c xnsulxn bxnd^to Z 
to form hexamers. The histidine residue at pos^on BIO xn 
insulin is involved in ^ ^ 

been reported by Ouinn et al. Supra, that tn 
mucaut. B!0 histidine to aspartic acid, allows formats 
aimers of human proinsulin but not 
maulin Like Growth Factor I ana II * 

xnsulin-me growth factors, or XO.S. » 

o „ f 19871) The complete amino acid sequences of IGF I 

a rrr tt have been determined (Rinderknecht, et ai 
and IGF-II have been a Rinderknecht. 

» - are *th single 

chain poises r e ^ 

seouence identity of 49 ana ^ f c 

h In insulin A and B chains. The connecting ^ ~ C 
„ region is considerably shorter than the one of proinsuUn 

LLcule » addition. IGF-I contains a .short. 8 amino 

r carbowter.inal extension peptide, termed the D 
rl ^r Thich - bomologous region exists in insulin 

-"-E^ TxU are a^-*^* 

proteolytic processing (Jansen et al m*X&-" / 
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[1984];Bell, et al Nature 310. 775 [ 1984 ) Hansen et al_£E£S 
letters 179. 243 [1985]. 

There exists a need for a method of producing 
polypeptides from their polypeptide precursors in cell 
5 culture. It is therefore an object of the present invention 
to provide host cells that express active prohormone 
convert ases that cleave polypeptide precursors to 
polypeptides. It is also an object of the present invention 
to provide a method for the production of a desired 

10 polypeptide in cell culture in a manner that results in the 
proper processing and glycosylation of the desired 
polypeptide. A related objective of the present invention 
is the production of polypeptide hormones, such as insulin, 
relaxin, and IGF-I. 

15 A further object of the present invention relates to 

providing polypeptide precursor mutants having prohormone 
convertase cleavage sites for processing by the host cell. 

There also exists a need to eliminate problems 
associated with supplying necessary polypeptide factors 

20 (e.g. insulin or transferrin) for the maintenance and 
growth of recombinant host cells. This need is 
particularly evident in the use of mammalian cell culture 
in the production of commercial polypeptides such as 
pharmaceutical products. It is therefore an object of the 

25 present invention to provide mammalian cell cultures that 
express prohormone convertases that enable processing of 
polypeptide factor precursors to polypeptide factors needed 
for mammalian cell culture proliferation and anabolism. It 
■ is an object of the present invention to produce 

30 polypeptide factors using host cells expressing prohormone 
convertases . 

; These and other objects of the invention will be 

apparent to the ordinary artisan upon consideration of 
the specification as a whole. 
35 Summary of the Invention 

The present invention describes a method for the 
production of a desired polypeptide in host cells 
expressing a prohormone convertase. In one embodiment the 
desired polypeptide is a prohormone processed to active 



WO 93/MM7 

. m another entailment; the desired polypeptide is 

s X: ceu enzyme, such as a prohormone convert**. Jn 
preferred embodiment, a prohormone convertase *s 
provided which is modified to have a host 
lite allowing for processing of the prohormone convertase 
precursor to active enzyme in the host cell. . e 

in one embodiment the host cell con* 
expre sea a prohormone convertase. In another .moment 

Te host i» " anS£ ° r " ed ' ^ 

^nTne aspect of the present invention the production 

20 recognizable by a host proau « of said 

. cell is dependent on the «°°™°J sald hoac 

polypeptide factor precursor and b ^ 
cell under conditions wherein tne po±yt> f 
Precursor is cleaved at said deavage site by t e host cell 

""Tarred "po^e^T- a host cell 
production of a aesirea if r ,.„»,-,« hv ») 

expressing a prohormone convertase is accomplished by a) 
expressing * v nucleic acid encoding a 

introducing into the host cell nucleic a 
30 desired polypeptide; and b, culture said host cell un 
Editions wherein said f^^^^^le 
Preferably the desired polypeptide is a poiypept 

- polypeptide >™ " ^ 

any polypeptide hormone comprrsed of^wo 

35 peptide cha^ - ^ rowth f accors , or 

? "Tong tarred prohormones are pro = . 
^elaxin. and precursors * ^ 
or II. The -prohormone may be a pronormone muu 

12 



WO 93/11247 PCT/US92/10621 

to contain one or more prohormone convertase cleavage 
sites. The prohormone or prohormone mutant may be 
processed by the prohormone convertase in an in vivo or in 
vitro manner. 

5 The method of producing polypeptide hormones may be 

further accomplished by inserting into a prohormone, a 
prohormone convertase cleavage site that facilitates 
processing by a prohormone convertase. The preferred 
hormone is a mammalian polypeptide hormone comprised of two 

10 or more polypeptide chains, for example insulin, relaxin, 
insulin-like growth factor I or insulin like growth factor 
II. 

In another aspect, the method may be practiced using 
cells transformed to contain a prohormone convertase fused 

15 to a hydrophobic transmembrane Golgi anchor, such as that 
from kex2 or furin. 

The present invention discloses nucleic acid (a) 
encoding murine prohormone convertases 1 and 2, (b) mutants 
of prohormone convertase that contain inserted convertase 

20 cleavage sites, and (c) mutants of prohormone convertase 
that contain hydrophobic anchor domains, or heterologous 
pre and prepro sequences from other processed polypeptides. 
Also disclosed are vectors containing such nucleic acid and 
cells expressing such nucleic acid. Methods of effecting 

25 transformation and methods of providing transformed 
mammalian cells are disclosed. 

nasffrinHon of the Figures 

30 FIGURE 1 , ,„ 

AtT20 murine prohormone convertase 1 cDNA iseq. 



35 



FIGURE 2 

AtT20 murine prohormone convertase 2 cDNA (Seq. ID #2) 
FIGURE 3 

Murine PCI amino acid (Seq. ID. S3 ) 



FIGURE 4 

40 Murine PC2 amino acid (Seq. ID. #4) 
FIGURE 5 

Relaxin dibasic cleavage site mutants 
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IrMrn -' »«"««cfl K m nP« . l»«m 

The term 'desireW pjIJl»|Jl«»i"™" 
polypeptide intended to be produced in . host cell, but 
. IkZ the host ceU either ^J-^TZZ 

^lypeptks include a polypeptide having as few as about 
ST aLo acids as well as much larger proteins sue as 
x, factor^!!! (2332 amino acids,. Desired polypeptides also 
Include any molecule having a pre- or prepro-am.n= , acxd 
e,»ence. as well as amino acid or glycosylatxon variants 
(including natural alleles, capable of ' 
bioloVcal activity in common with the 
15 Preferably the polypeptide is hecerologous to the hos cell 
in which it is made and is a human polypeptide. p " £ «" a 
Polypeptides are those that have pharmaceutical utiUty. 

hiss r^r:*:^ 

l^ing'normone, glucagon, factor ^ 
bombesin, factor XX. thrombin, nemop^iet.c growth, fac o 
tu mor necrosis factor-alpha and -b.ta, 

w a prF and bFGF; epidermal growun 

and TGF-beca, insulin-like growth factor-I and II. 
■ ^hroPOietin osteoinductive factors, an 
I^terferon-alpha. -beta, and -gamma, nerve 
!1f, BNDF. andNT-3, colony -stimulating factors (CSFs, . 
T JSf om-csf. andG-CSF, interleufcins (»). e.g.. 
a I' 1X 2 lT-3. il-4. etc., decay accelerating factor, 
atrial^atJiuretic peptides A. B or c, and fragments of any 
' of tnt atove-Hsted polypeptides. In addition, one or more 
p'ele erSled amino acid residues in the Polypeptide may be 




10 



WO 93/11247 PCT/USW/10621 

substituted, inserted, or deleted, for example, to produce 
products with improved biological properties, or to vary 
expression levels, such as adding or deleting a dibasic 
amino acid site susceptible to cleavage by host cell 
5 enzyme. Some of the desired polypeptides falling within 
this present invention may optionally possess covalent or 
non-covalent modifications of features of a naturally 
occurring molecule. for example, glycosy lation 

modifications. 

"Desired polypeptide precursor mutant" refers to a 
desired polypeptide precursor comprising a non-naturally 
occurring enzyme cleavage site as well as a desired 
polypeptide precursor mutant optionally having amino acid 
substitutions, deletions, and/or insertions at certain 
15 other positions provided that the final construct possesses 
the desired activity. The desired polypeptide precursor 
mutants of the present invention also include those mutants 
wherein glycosylation or other features of a naturally 
occurring molecule have been modified covalently or non- 
20 covalently provided that the final mutant construction 
possesses the desired activity. 

As used herein, -polypeptide factor.* refers to any 
protein necessary for the survival or growth of a host cell 
in culture. The polypeptide factor may be a hormone, 
25 growth factor, peptide hormone, autocrine factor, transport 
protein, oncogene/proto- oncogene and the like. Examples 
of polypeptide factors that are hormones are. for example, 
insulin, follicle stimulating hormone (FSH) . calcitonin, 
leutinizing hormone (LH) . glucagon, parathyroid hormone 
30 (PTH), thyroid stimulating hormone (TSH). thyroid releasing 
hormone (TRH), and growth hormone. Additional examples of 
polypeptide factors are the transport proteins, such as, 
transferrin, serum albumin, ceruloplasm, low density 
lipoprotein (LDL) and high density lipoprotein (HDL) . Some 
35 polypeptide factors, often are described as autocrine 
because, in some instances, the cells they are secreted 
from can respond to the secreted factor; example of "such 
factors are interleukin-2, epidermal growth factor (EGF) , 
fibroblast growth factor (FGF). thrombin, nerve growth 

15 
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£ actor. hemopoietic growth ■ t>"°**^ T ^XZ 
macrophage colony stimulating factor .GH-CSr. . 



ta.y e t;uxv**jr ** * - v ' f 



XfilSL 



the expression of certain oncogenes/proto-oncogenes. The 
s proteins encoded by these proto-oncogenes * 
the polypeptide factors of this invention are growth 
a tors transducing proteins and membrane receptor 
^es of a orowth factor is PDGF Cb sub^ encoded » 
the sis oncogene. Examples of peripheral membrane protein, 
„ «e the truncated cell surface receptor for EGF encoded by 
10 are cne «.r-«p/csF-l encoded by 

erb-B. the cell surface receptor for M-CSF/CSF en 
fms and the receptors encoded by neu and ros. » ^example 
of a transducing protein is tyrosine Kinase at the inner 
surface of the plasma-membrane encoded by abl. ^J*" 
1S poiypeptide factors are typically not added to a culture 
ZTJ. they may be substituted for another 
factor The definition of polypeptide factor includes 
amino a"* mutants which maintain the functional 
cnarleristics of the polypeptide factor. These mutants 
M may comprise one or more amino -^T^^T 
overa ll seance an ^ J. ^ ^ 
substitutions, and/or insertions recom binant 
in the overall sequence. Through the use r 
» technology polypeptide factor mutants may be prepared 
, s bv altering the underlying m*. All such variations or 
le™ in the structure of a polypeptide factor mutant 
is included within the scope of this invent.cn so long as 
the functional activity is maintained. 

■polypeptide factor precursor mutant refers to 
3, polypeptide factor precursor comprising a non-nacurally 
occurring enryme cleavage site. 

•Polypeptide factor-dependent host cell refers to 

host cellaring one or more t^^^J^ 
Culture medium for growth or survival. The polypeptide 
,s fa or s^or a particular host cell is determined using 
Lthods known to the ordinarily skilled artisan. . 
EEL^r P*~«. -tor from the medium may 
result in death of the cell or in inhibited growth. 
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•Heterologous polypeptide factor- is defined as a 
polypeptide factor not naturally occurring in the host cell. 

A "heterologous* element is defined herein to mean 
foreign to the cell, or (unless otherwise specified) 
5 homologous to the cell but in a position within the host 
cell in which the element is ordinarily not found. An 
•endogenous" element is defined herein to mean naturally 
occurring in the cell. An "endogenous host cell enzyme is 
defined to mean an enzyme which is endogenous to the cell. 
10 "Prohormone* refers to a hormone precursor. 

"Prohormone convertase" enzymes are specialized 
proteinases having conserved active site domains which are 
substrate specific and cleave exclusively at certain sets of 
basic residues, preferably Lys-Arg. Arg-Arg. Lys-Lys and 
15 Arg-Lys. This type of proteinase is responsible for 
processing large precursor proteins, such as prorelaxin. to 
their biologically active form. In the present invention 
prohormone convertase is used to refer to mammalian PCI and 
PC2, furin, and includes mammalian, yeast, or any prohormone 
20 convertase that is biologically active as described above. 
The yeast enzyme, Kex2. is specifically excluded from the 
definition of prohormone convertase enzymes. In the present 
invention 'prohormone convertase cleavage site" refers to a 
cleavage site recognized by a prohormone convertase. 
25 "Basic residue" refers to amino acids in which the R 

groups have a net positive charge at pK 7.0, for example: 
Lysine, Arginine, Histidine. 

"Dibasic cleavage site" contains two amino acids with 
basic charges on their side chains that are specifically 
30 cleaved by Iprohormone convertases. the preferred amino 
acids are Lys-Arg, Arg-Arg. Lys-Lys and Arg-Lys. 

•Kex2" refers to a Saccharomyces yeast endoprotease 
which specifically processes a mating type factor and a 
killer factor. Kex2 contains an amino terminal catalytic 
35 aoraain followed by a cysteine-rich region, a transmembrane 
domain, and a short cytoplasmic tail. 

Turin" refers to .a protein encoded by the gefte fur 
which has homology to the Kex2 protein and is involved in 
processing of protein precursors. Furin is expressed 

17 



WO 93/11247 

constitutive* in most cell types including CHO. HepG2. 

if \m iTIiiifiiii m iiiTwiii iMimwi i mi ■ " ir 

cleaving Lys-Arg residues. Like Kex2, human furxn contains 
5 an amino terminal catalytic domain followed by a cysteine- 
rich region, a transmembrane domain, and a short cytoplasmic 

tai1 ' -Mammalian PCI (mPCl)' refers to the sequence 
identified in Fig 4, and all mammalian prohormone 
10 converses having equal to or greater than 95% homology to 
the sequence in Fig. 4. 

-Mammalian PC2 (mPC2>- refers to the sequence 
identified in Fig. 5 and all mammalian prohormone convertases 
having equal to or greater than 95% homology to the sequence 

m Fig A 5 ; cieavage s . ce recognizable by a host cell enzyme- 
refers to the cleavage site of an enzyme naturally occurring 
in a host cell The preferred naturally occurring enzyme of 
the present invention is a prohormone convertase. 
20 -Prohormone convertase precursor mutant- refers to a 

. prohormone convertase precursor comprising a non-naturaiiy 
occurring enzyme cleavage site as well as desired 
polypeptide precursor mutants optionally having 
Lbstitutions, deletions, and/or insertions at certain other 
2S positions provided that the final construct possesses the 

desired activity. 

-Precursor- means a form of the protein which may be 
converted into the desired protein by processing. It may be 
a natural pro-form or prepro-form of the protein, or a 
30 synthetic pro-form wherein a gene coding for the desired 
protein is preceded by a heterologous signal, or leader 
sequence, and such as constructs which are routinely 

produced in vitro. 

M, -isolated- polypeptide means polypeptide which has 
,s been identified and separated and/or recovered from a 
component o £ its natural environment. 
components o£ its natural environment are materials which 
would interfere with diagnostic or therapeutic uses for the 
polypeptide, and may include proteins, hormones, and other 
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substances. In some embodiments, the polypeptide will be 
purified (1) to greater than 95% by weight of protein as 
determined by the Lowry method, and most preferably more 
than 99% by weight. (2) to a degree sufficient to obtain at 

5 least 15 residues of N-Terminal or internal amino acid 
sequence by use of a spinning cup sequenator. or (3) to 
homogeneity by SDS-PAGE using Coomassie blue or, 
preferably, silver stain. This definition specifically 
includes polypeptides present in situ in a host cell 

10 wherein the host cell in its native form lacks the 
polypeptide. Ordinarily, however, isolated polypeptide 
will be prepared by at least one purification step. in 
preferred embodiments, isolated prohormone convertase is 
utilized. 

15 -Nucleic acid" refers' to a nucleotide sequence 

comprising a series of nucleic acids in a 5' to 3' 
phosphate diester linkage that may be either an RNA or a 
DNA sequence. If DNA, the nucleotide sequence is either 
single or double stranded. polypeptide-encoding nucleic 

20 acid is RNA or DNA. that encodes a biologically or 
antigenically active polypeptide, is complementary to 
nucleic acid sequence encoding such polypeptide, or 
hybridizes to nucleic acid sequence encoding such 
polypeptide and remains stably bound to it under stringent 

25 conditions. 

-Isolated" polypeptide nucleic acid is a nucleic acid 
that is identified and separated from at least one 
contaminant nucleic acid with which it is ordinarily 
associated in the natural source of the nucleic acid. 

30 Isolated nucleic acid is other than in the form or setting 
in which it is found in nature. Isolated prohormone 
convertase nucleic acid therefore distinguishes prohormone 
convertase nucleic acid as it exists in natural cells. 
However, isolated prohormone convertase nucleic acid 

35 includes prohormone convertase in ordinarily prohormone 
convertase-expressing cells where the nucleic acid is. for 
example, in a chromosomal location different from that of 
natural cells. 
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-Hydrophobic transmembrane anchor" is the hydrophobic 

„f MWM8 localized in the golgi, RER or other 

cellular membranes whicn racTTrcaces-iurarfwrow"^ 
enzyme in that region and reduces the loss of enzyme from 
s the cell, specifically illustrating such anchors are the 
kex2 and furin transmembrane anchors. 

The term -mutant- as used herein is defined as a 
molecule in which the amino acid sequence, glycosylate, or 
other feature of a naturally occurring molecule has been 
10 modified covalently or noncovalently and is intended to 
include variants. Some of the variants falling within this 
. invention possess amino acid substitutions or additions of a 
host cell enzyme cleavage sites and optionally also 
substitutions, deletions, and/or insertions at certain other 
15 positions provided that the final construct possesses the 

desired activity. 

The term -host cell- refers to those cells capable of 
growth in culture and capable of expressing a prohormone 
convertase. and / or a desired protein and/or a polypeptide 
20 factor(s). The host cells referred to in this disclosure 
encompass cells in in vitro culture as well as cells that 
are within a host animal. While the preferred host cells in 
in vitro culture of this invention are mammalian cells, 
other cells may be used. Some of the embodiments of this 
25 invention involve the use of a host cell enzyme which itself 
requires processing from a precursor to an active form; for 
those host cells, such as bacterial or yeast, that lack the 
enzyme required to process the host cell enzyme, such enzyme 
may be introduced into the host cell to facilitate practice 
30 of the methods of this invention. 

The expressions -cell." and -cell culture" are used 
interchangeably and all such designations include progeny 
and ancestors. It is also understood that all progeny may 
not be precisely identical in DNA content, due to 
35 deliberate or inadvertent mutations. Mutant progeny that 
have the same function or biological activity as screened 
for in the cell are included. Where distinct designations 
are intended, it will be clear from the context. 
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•Transfection" refers to the taking up of an 
expression vector by a host cell whether or not any coding 
sequences are in fact expressed. Numerous methods of 
transfection are known to the ordinarily skilled artisan 
5 for example. CaP0 4 and electroporation . Successful 

transfection is generally recognized when any indication of 
the operation of this vector occurs v/ithin the host cell. 

■Transformation" means introducing DNA into an 
organism so that the DNA is replicable, either as an 

10 extrachromosomal element or by chromosomal integrant. 
Depending on the host cell used, transformation is done 
using standard techniques appropriate to such cells. The 
calcium treatment employing calcium chloride, as described 
in section 1.82 of Sambrook ec al . . supra, is generally 

15 used for prokaryotes or other cells that contain 
substantial cell-wall barriers. Infection with 
Agroiacterium cumefaciens is used for transformation of 
certain plant cells, as described by Shaw et al.. GSZZ. 21'. 
315 (1983) and WO 89/05859 published 29 June 1989. For 

20 mammalian cells without such cell walls, the calcium 
phosphate precipitation method described in sections 16.30- 
16.37 of Sambrook et al, supra, is preferred. General 
aspects of mammalian cell host system transformations have 
been described by Axel in U.S. 4.399,216 issued 16 August 

25 1983. Transformations into yeast are typically carried out 
according to the method of Van Solingen et al.. J , BacC . . 
130 ; 946 (1977) and Hsiao et al., pror NfiU ftrflrt ■ Sc i. 
UlSjy.. 3829 (1979). However, other methods for 

introducing DNA into cells such as by nuclear injection, 

30 electroporation. or by protoplast fusion may also be used. 
' -Stably transformed host cell" refers to a cell 
wherein, the' inserted nucleic acid is present either 
integrated in the host cell or extrachromosomally . and 
wherein the inserted nucleic acid is continuously produced 

35 by the cell for about two weeks after insertion of nucleic 
acid. General aspects of mammalian cell host system 
transformations have been described by Axel in U.S. 
.4,399,216 issued 16 August 1983. 

21 



W093/HW 

The term -medium- refers to the aqueous environment in 
mLalian cells are grown in cuiture. The ^ 

™ * J- ~ ~ r r 

5 „ the addition o £ nutritional and growth factors necessary 
growth or survival. -Serum-free medium- 
medium lacWng serum. The hordes growth ^to- 
transport proteins, peptide hordes and the like typically 
found in serum which are necessary for the 
u growth of particular cells in culture „ « 
as a supplement to serum-free medium. A aen 
refers to a medium comprising nutritional and hormonal 
"dements necessary for the survival and growth of 
c eUs in culture such that the components of the medium are 
„ L„n. A defined medium provided by the method o the 
instant invention establishes a local environment 
particular host cell £c *- -~ 

environment of the medium. Cells -n ootira al 
generally retire insulin and transfe rrm optimal 
„ growth. These two factors should b te 

. determining what factors a given cell requires. Mo 
Unas require one or -a oj -e -wth = . T^ 
include epidermal growth factor (EGF1 . tl 
factor <PG fl . insulin-liXe growth factors I and II IGF 
25 nerve growth factor W. etc 

« which may be necessary include: cransporc 

finlg pIIL Te,.. ceruioplasmin, high and low density 

Upoprotein MM. ^- n ' "nybridi.ation 

•Stringent conditions" are those n> 

3„ conditC:: that U, employ low ionic 

ssras^^^ ^ teins 

hvbridization a denaturing agent such as -formamide. for 
hybridization ^ ^ ^ SOTm 

T nio/l* F ToU/om polyvinylpyrrolidone/50 mM sodium 
,S albumin/0/1% Ficoll/O/ P ^ ^ ^ 

phosphate buffer at pH b.s wicn 

\ .r 42' C- or (3) employ 504 formamide. 5 * SSC 
TlT» »*l 0.075 . sodium citrate,. 50 m* sodium 
phos P h«e U "«» Phosphate. .5 , 
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Denhardt's solution, sonicated salmon sperm DNA (50 g/ml), 
0.1% SDS, and 10% dextran sulface at 42 C, with washes at 
42 C in 0.2 x SSC and 0.1% SDS. 

"Recovery" or -isolation" of a given fragment of DNA 
5 from a restriction digest means separation of the digest on 
polyacrylamide or agarose gel by electrophoresis, 
identification of the fragment of interest by comparison of 
its mobility versus that of marker DNA fragments of known 
molecular weight, removal of the gel section containing the 

10 desired fragment, and separation of the gel from DNA. This 
procedure is known generally. For example, see Lawn et 
a2., Nucleic Acids Res . . £: 6103-6114 (1981), and Goeddel 
et al. § Wnpleic Ac ids Res. £: 4057 (1980). 

"Polymerase chain reaction, - or "PCR," as used herein 

15 generally refers to a procedure wherein minute amounts of a 
specific piece of nucleic acid, RNA and/or DNA, are 
amplified as described in U.S. Pat. No, 4,683,195 issued 28 
July 1987. Generally, sequence information from the ends 
of the region of interest or beyond needs to be available, 

20 such that oligonucleotide primers can be designed; these 
primers will be identical or similar in sequence to 
opposite strands of the template to be amplified. The 5* 
terminal nucleotides cf the two primers may coincide with 
the ends of the amplified material. PGR can be used to 

25 amplify specific RNA sequences, specific DNA sequences from 
total genomic DNA, and cDNA transcribed from total cellular 
RNA, bacteriophage or plasmid sequences, etc. See 
generally Mullis et al . . Cold Sori no Harbor Svmo. Quant, 
Biol . . 51 : 263 (1987); Erlich, ed . , ££E Technology* 

30 (Stockton Press, NY, 1989). As used herein, PGR is 
considered to be one, but not the only, example of a 
. nucleic acid polymerase reaction method for amplifying a 
nucleic acid test sample, comprising the use of a known 
nucleic acid as a primer and utilizing a nucleic acid 

35 polymerase to amplify or generate a specific piece of 
nucleic acid. 

"Oligonucleotide-mediated mutagenesis* refers to a 
method for preparing substitution, deletion, and insertion 
mutants of prohormone convert ase 1 or 2 encoding DNA. This 
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r„chnioue is well known in the arc as. described by Adelman 
^ i: .1MUM3.. Briefly, the prohormone 



convertase 1 or 2 -D(m-T.-s J WW— » - 
oligonucleotide encoding the desired station to a vm 
5 template, where the template is the single-stranded f» of 
I plasmid or bacteriophage containing the unaltered or 
native DNA sequence of the prohormone convertase l ot 2. 
After hybridization, a a» polymerase is used to 
an entire second complementary strand of the template that 
U will thus incorporate the oligonucleotide primer, and will 
Le for the selected alteration in the prohormone 
convertase 1 or 2. Generally, oligonucleotides of at least 
25 nucleotides in length are used. » opt«l 
oligonucleotide will have U to 15 nucleotides that are 
U completely complementary to the template on either side of 
the : nucleotides, coding for the mutation. This enures 
that the oligonucleotide will hybridize properly to the 
single-stranded DKA template molecule. The 
oligonucleotides are readily synthesized using technigu es 
M taown in the art such as that described by Crea et al. 
, r|| „ 1T1 ^ «■ tt. 5765 [19781). 

■ J^osylation- refers to the Post-translationa 
modification process of adding a series of sugar residues 
To. proteins to produce glycoproteins. Glycosylacion an 
J5 occur in the cytosol. the endoplasmic reticulum, or the 
Golgi apparatus of mammalian cells. 

■Zs- and -PRK7- are expression vectors used to 
' transform mammalian ceils. The construction of 

vector pRKS was described in European Patent Application 
vector v ,. M The p rk7 vector was 

30 0307247A2, published March 15, 1989. The P 
' described in European Patent Application 278.776 published 

^Z^**^ secret pathway Ve^s to the 
cellular pathway for exostosis which is regulated over 

! of time by physiological stimuli, such as 

35 short periods or time oy P«y » 

cyclic AMP. This pathway is not required for cell 
lability and is only present in specialized secretory 
cells such as .endocrine and exocrine cells. 
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-Constitutive protein secretory pathway" refers to the 
unregulated cellular pathway which is found in cells such 
as macrophages and fibroblasts. 

•Human insulin" refers to a protein exhibiting a 

5 biological activity in common with naturally occurring 
human insulin. Human insulin biological activity includes 
promoting glucose utilization, protein synthesis, and the 
formation and storage of neutral lipids. -Human 
proinsulin" is defined to be a molecule containing the B, 

10 C, and A chain amino acid sequence and includes amino acid 
variants that maintain the functional characteristics of 
insulin discussed above. These variants may comprise one 
or more amino acid differences in the overall sequence and 
may be prepared by deletions, substitutions, and/or 

15 insertions of one or more amino acids in the overall 
sequence. Through the use of techniques common in the 
field, human proinsulin mutants may be prepared by altering 
the protein itself or the nucleic acid encoding the 
protein. All such variations or alterations in the 

20 structure of human proinsulin resulting in human proinsulin 
variants are included within the scope of this invention so 
long as the functional activity of proinsulin is 
maintained. 

"Human relaxin- denotes a functional protein capable 
25 of exhibiting a biological activity. Human relaxin 
biological activity is defined as any of 1) immunological 
cross-reactivity with at least one epitope of human relaxin 
or 2) the possession of a least one hormonal function in 
common with human relaxin. "Human prorelaxin- is defined 
10 to be a molecule containing the B. C, and A chain amino 
acid sequence and includes amino acid variants which 
maintain the functional characteristics discussed above. 
These variants may comprise one or mor,e amino acid 
differences in the overall sequence and may be prepared by 
' 35 deletions, substitutions, and/or insertions of one or more 
amino acids in the overall sequence. Through the use of 
recombinant DNA technology human prorelaxin mutants may be 
prepared by altering the underlying DNA. All such 
variations or alterations in the structure of human 
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prorelaxin resulting in human prorelaxin variants are 
included within the scope of this invention so long as the 




unctioiidj 

As used herein, "IGF-I" refers to IGF-I from any 
s species, including bovine, ovine, porcine, equine, and 
preferably human, in naturally occurring-sequence or in 
variant form, or from any source, whether natural, 
synthetic, or recombinant. Variants may comprise one or 
more amino acid differences in the overall sequence and may 
10 be prepared by deletions, substitutions, and/ or insertions 
of one or more amino acids in the overall sequence. All 
such variations or alterations in the structure of IGF-I 
resulting in human prorelaxin variants are included within 
the scope of this invention so long as the functional 
15 activity is maintained. 

Use of Prohormone Convertase in Mammalian Cell 

Culture 

in the present invention, the host cells expressing a 
prohormone convertase may be used to express polypeptide 

20 factors needed for cell growth and anabolism. 

Mammalian cells frequently require one or more 
hormones from each of the following groups: steroids, 
prostaglandins, growth factors, pituitary hormones, and 
polypeptide hormones. Most cell types require insulin to 

25 survive in serum-free media. (Sato. G.H. et al. in Growth 
of Cells in Hormonally Defined Media. [Cold Spring Harbor 
Press, N.Y., 1982]). in addition to the hormones, cells 
may require transport proteins such as transferrin (plasma 
iron transport protein), ceruloplasmin (a copper transport 

30 protein) . and high density lipoprotein (a lipid carrier) to 
be added to cell media. The set of optimal hormones or 
transport proteins will vary for each cell type. Most of 
these hormones or transport proteins have been added 
exogenously or. in a rare case, a mutant cell line has been 

3S found which does not require a particular factor. 

Cellular proliferation has been studied to elaborate 
the events necessary to lead from quiescent growth arrest 
to the cellular commitment to proliferate. Various 
polypeptide factors have been found to be involved in that 
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transformation. These transformed cells have been found to 
produce peptide growth factors in culture. (Kaplan, P.L. 
et al.. PNAS 79:485-489 11982)). The secretion from a cell 
of a factor to which that same cell can respond has been 

5 referred to as an -autocrine- system. Numerous factors 
have been described as autocrine: bombesin, incerleukin-2 
(Duprez, V. et al! PNAS 82:6932 [1985]); insulin, (Serrero. 
G., In Vitro Cellular L Dev. Biol. 21191:537 [1985]); 
transforming growth factor alpha (TGF-a) , - platelet-derived 

10 growth factor (PDGF) ; transforming growth factor-beta (TGF- 
b), (Sporn, M.B. & Roberts, A.B.. Nature 313:745 [1985]); 
sarcoma growth factor (SGF). (Anzano. M.A. et al.. PNAS 
80:6264 [1983]); and, hemopoietic growth factor, 
granulocyte-macrophage colony stimulating factor CGM-CSF) . 

15 (Lang, R.A. et al., Cell 43:531 [1985]). The methods of 
the present invention are suitable for cells expressing one 
or more prohormone convertases and further comprising 
nucleic acid encoding a polypeptide factor required for 
proliferation or for maintenance of cellular integrity. In 

20 the present invention the preferred polypeptide factor is 
insulin. 

in the present invention, the host cells expressing a 
prohormone convertase may be used to express any desired 
polypeptide which requires processing from a precursor 
25 form. The cells of the present invention may be used in 
cell culture to produce any polypeptide of interest even 
those not requiring processing by a prohormone convertase. 

In the present invention nucleic acid encoding a 
prohormone convertase may be introduced prior to the 
30 introduction of nucleic acid encoding a desired polypeptide 
or a polypeptide factor. In the present invention, nucleic 
acid encoding a prohormone convertase may be introduced 
simultaneously with nucleic acid encoding a desired 
polypeptide or a polypeptide factor wherein the nucleic 
35 acid may be on the same or separate vectors. 

As described more fully herein, the present invention 
describes a method for the production of a heterologous 
polypeptide factor in a polypeptide factor-dependent host 
cell comprising a) introducing into the polypeptide factor- 
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dependent host cell nucleic acid encoding a heterologous 

Ti.riV^MMwiiimiNni 1 1 T a n ■ * Cl .^^ 

cell is dependent on the cleavage product of said 
5 polypeptide factor precursor; and b) culturing said host 
cell under conditions wherein the polypeptide factor 
precursor is cleaved at said cleavage site by the host cell 
enzyme, thereby producing said polypeptide factor. 
Optionally polypeptide factor is recovered. 
0 The present invention further describes a method for 

the production of a desired polypeptide in a host cell 
expressing a prohormone convertase comprising a) 
introducing into the hose cell expressing said prohormone 
convertase nucleic acid encoding a desired polypeptide; and 
L5 b) culturing said host cell under conditions wherein said 
desired polypeptide is expressed. Optionally the desired 
polypeptide is recovered. 

*For any of the cleavage events described herein, 
additional host cell enzyme or heterologous enzyme is added 
10 to the host cell as desired. 

The present invention discloses nucleic acid encoding 
a polypeptide factor precursor mutant comprising an 
prohormone convertase cleavage site. Another aspect of the 
present invention is a nucleic acid encoding a prohormone 
25 convertase that is joined to nucleic acid encoding a 
hydrophobic transmembrane anchor. This joined nucleic acid 
directs the synthesis of a polypeptide fusion product 
wherein the catalytically active prohormone convertase is 
covalently attached to a hydrophobic transmembrane anchor. 
30 Among the preferred hydrophobic transmembrane anchors are 
the KEX2 hydrophobic transmembrane anchor and the furin 
hydrophobic transmembrane anchor. 

The present invention also provides for a prohormone 
convertase precursor mutant having nucleic acid encoding a 
35 prohormone convertase cleavage site. 

The present invention further provides for host cells 
having nucleic acid encoding a polypeptide factor precursor 
mutant comprising a prohormone convertase cleavage site. 
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The nucleic acid and methods disclosed herein are 
suitable for use for expression in a wide range of host 
cell lines. Mammalian cells are the preferred host cell 
for expressing prohormone convertases. The host cell of 

5 the presenc invention may be a mammalian cell that does not 
normally express detectable amounts of prohormone 
convertase. In addition, the mammalian cell of the present 
invention may be a mammalian cell that produces levels of 
prohormone convertase that are insufficient to process the 

10 desired polypeptide precursor to desired polypeptide. 
Transformation of such cells by nucleic acid encoding a 
prohormone convertase results in an increase in prohormone 
convertase sufficient to process desired polypeptide 
precursor to polypeptide, 

15 Isolation of Nucleic Acid 

The nucleic acid encoding the prohormone, convertase, 
or other polypeptide of interest may be obtained from any 
cDNA library prepared from tissue or cells believed to 
possess the polypeptide mRNA and to express it at a 

20 detectable level. The desired gene may also be obtained 
from a genomic library. 

Libraries are screened with probes designed to 
identify the gene of interest or the protein encoded by it. 
For cDNA expression libraries, suitable probes include 

25 monoclonal .or polyclonal antibodies that recognize and 
specifically bind to the desired polypeptide; 
oligonucleotides of about 20-80 bases in length that encode 
known or suspected portions of the polypeptide of 
interest's cDNA from the same or different species; and/or 

30 complementary or homologous cDK-As or fragments thereof that 
encode the same or a similar gene. Appropriate probes for 
screening genomic DNA libraries include, but are not 
limited to, oligonucleotides; cDNAs or fragments thereof 
that encode the same or a similar gene; and/or homologous 

35 genomic DNAs or fragments thereof. Screening the cDNA or 
genomic library with the selected probe may be conducted 
using standard procedures as described in chapters 10-12 of 
Sambrook et al., supra. 

An alternative means to isolate the gene encoding the 
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polypeptide of interest is to use polymerase chain reaction 
(PCR) methodology as described in section 14 of Sambrook et 
^^ugEa^his^nethQdxecMi^ 
^f^g^^J?§^^ l 5^^Hy0r^.'a , i*z'e'-"efw--e , n'e-» ^^mjl^ttl$msa*s&asBm 
5 Strategies for selection of oligonucleotides are described 
below. 

Another alternative method for obtaining the gene of 
interest is to chemically synthesize it using one of the 
methods described in Engels et al. (Agnew. Chem. Int. Ed. 

10 Engl., 28: 716-734 [1989]). These methods include 
triester, phosphite, phosphoramidite and H-Phosphonate 
. methods, PCR and other autoprimer methods, and 
oligonucleotide syntheses on solid supports. These methods 
may be used if the entire nucleic acid sequence of the gene 

15 is known, or the sequence of the nucleic acid complementary 
to the coding strand is available, or alternatively, if the 
target amino acid sequence is known, one may infer 
potential nucleic acid sequences using known and preferred 
coding residues for each amino acid residue. 

20 A preferred method of practicing this invention is to 

use carefully selected oligonucleotide sequences to screen 
cDNA libraries from various tissues, depending on the 
source of the gene or polypeptide of interest. 

The oligonucleotide sequences selected as probes 

25 should be of sufficient length and sufficiently unambiguous 
that false positives are minimized. The actual nucleotide 
sequence(s) is usually based on conserved or highly 
homologous nucleotide sequences or regions of other 
polypeptides homologous to the polypeptide of interest. 

30 The oligonucleotides may be degenerate at- one or more 
positions. The use of degenerate oligonucleotides may be 
of particular importance where a library is screened from a 
species in which preferential codon usage in that species 
is not known. The oligonucleotide must be labeled such 

35 that it can be detected upon hybridization to DNA in the 
library being screened. The preferred method of labeling 
is to use 32-P labeled ATP with polynucleotide kinase*, as 
is well known in the art, to radiolabel the 
oligonucleotide. However, other methods may be used to 
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label the oligonucleotide, including, but not limited to, 
biotinylation or enzyme labeling. 

Of particular interest is prohormone convertase 
nucleic acid that encodes a full-length polypeptide, in 
5 some preferred embodiments, the nucleic acid sequence 
includes the prohormone convertase signal sequence. 
Nucleic acid having all the protein coding sequence is 
obtained by screening selected cDNA or genomic libraries, 
and, if necessary, using conventional primer extension 

10 procedures as described in section 7.79 of Sambrook et al., 
supra, to detect precursors and processing intermediates of 
mRNA that may not have been reverse- transcribed into cDNA. 

In preferred embodiments of this invention DNA 
hybridization probes are used to identify novel prohormone 

15 convertase candidate enzyme messages in cell lines 
containing regulated secretory pathways. The murine 
pituitary tumor cell line AtT-20 is suitable because it is 
neuroendocrine-derived, and secretes ACTH. It would 
therefore be expected to contain processing enzymes 

20 involved in the conversion of the prohormone 
proopiomelanocortin (POMC) precursor to mature ACTH, a 
process which requires cleavage at specific pairs of basic 
amino acids in the prohormone substrate. 

Candidate cDNAs expressed, in the AtT-20 cells may be 

25 amplified by the MOPAC procedure (Innis 1990) using 
degenerate primer sequences based on those of Smeekens and 
Steiner (1990) . In particularly preferred embodiments, two 
PC-like cDNAs are amplified and cloned from the AtT-20 
cells: mPC2 (greater than 95% homologous to the human PC2 of 

30 Smeekens and Steiner) and second sequence similar to mPC2, 
termed mPCl. 

Construction of polypeptide mutants 

Amino acid sequence variants of the prohormone 
convertase, polypeptide factor, desired polypeptide or 
35 other polypeptide of interest are prepared by introducing 
appropriate nucleotide changes into the encoding DNA, or by 
in vitro synthesis of the desired polypeptide. "Such 
variants include, for example, deletions from, or 
insertions or substitutions of, residues within the amino 
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acid sequence for the polypeptide of interest. Any 
combination of deletion, insertion, and substitution can be 
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final construct possesses the desired cnaracterrstx CS --xxx C 
5 amino acid changes also may alter post-translational 
processes of the polypeptide, such as changing the number 
or position of glycosylation sites, altering the membrane 
anchoring characteristics, and/or altering the intra- 
cellular location of the polypeptide by inserting, 
10 deleting, or otherwise affecting the leader sequence of the 
naturally occurring polypeptide of interest. 

in designing amino acid sequence polypeptide variants, 
the location of the mutation site and the nature of the 
mutation will method of labeling is to use 32-P labeled ATP 
15 with polynucleotide kinase, as is well known in the art. to 
radiolabel the oligonucleotide. However, other methods may 
be used to label the oligonucleotide, including, but not 
limited to. biotinylation or enzyme labeling, substituting 
first with conservative amino acid choices and then with 
20 more radical selections depending upon the results 
achieved*. (2) deleting the target residue, or (3) inserting 
residues of the same or a different class adjacent to the 
located site, or combinations of options 1-3. 

. There are two principal variables in the construction 
25 of amino acid sequence variants: the location of the 
mutation site and the nature of the mutation. In general, 
the location and nature of the mutation chosen will depend 
upon the characteristic to be modified. 

Amino acid sequence deletions generally range from 
30 about 1 to 30 residues, more preferably about 1 to 10 
residues/and typically are contiguous. Deletions may be 

* introduced into regions of low homology between the 
polypeptide of interest and other polypeptides to modify 

• the activity of the polypeptide of interest. Deletions 
35 from the polypeptide of interest in areas of substantial 

homology with any other polypeptide will be more likely to 
modify the biological activity of the polypeptide of 
interest more significantly. The number of consecutive 
deletions will be selected so as to preserve the tertiary 
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structure of polypeptide of interest in the affected 
domain, e.g., beta-pieated sheec or alpha helix. 

Amino acid sequence insertions include amino- and/or 
carboxyl-terminal fusions ranging in length from one 
5 residue to polypeptides containing a hundred or more 
residues, as well as intraseguence insertions of single or 
multiple amino acid residues. Intrasequence insertions 
(i.e., insertions within the sequence of the polypeptide of 
interest) may range generally from about 1 to 10 residues, 

10 more preferably 1 to 5, most preferably 1 to 3, An example 
of a terminal insertion is an N-terminal methionyl residue, 
an artifact of the direct expression of a polypeptide in 
bacterial recombinant cell culture. 

Other insertional variants of the polypeptide of 

15 interest include the fusion to the N- or C-terminus of the 
polypeptide of immunogenic polypeptides, e.g., bacterial 
polypeptides such as beta-lactamase or an enzyme encoded by 
the E. coli trp locus, or yeast protein, and C-terminal 
fusions with proteins having a long half-life such as 

20 immunoglobulin constant regions (or other immunoglobulin 
regions), albumin, or ferritin, as described *in VJO 89/02922 
published 6 April 1989. 

Another group of variants are amino acid substitution 
variants. These variants have at least one amino acid 

25 residue in the polypeptide of interest removed and a 
different residue inserted in its place. The sites of 
greatest interest for substitutional mutagenesis include 
sites identified as the active site(s)« of the polypeptide 
of interest, and sites where the amino acids found in 

30 homologous polypeptides from various species are 
substantially different in terms of side-chain bulk, 
charge, and/or hydrophobicity . 

DNA encoding amino acid sequence variants of the beta- 
8 integrin subunit is prepared by a variety of methods 

35 known in the art. These methods include, but are not 
limited to, isolation from a natural source (in the case of 
naturally occurring amino acid sequence variants*) or 
preparation by oligonucleotide-mediated (or site-directed) 
mutagenesis, PCR mutagenesis, and cassette mutagenesis of 

33 



10 



15 



20 



25 



30 



35 



an earlier prepared variant or a" non-variant version of the 
beca-8 integrin subunit. These techniques may utilized 

nucleic acid complementary to ?CTIa^SiiP™IP^ 

nucleic acid. In the present invention, the site-directed 
mutagenesis method of Kunkel. M n rUn^ Pf mrynolC-gV . 
154-367-383, 1987 are particularly preferred. 

" Another method for preparing the prohormone convercase 
mutants of this invention are preferably constructed by 
mutating the nucleic acid sequences that encode the native 
prohormone convertase. Generally, particular regions or 
sites of the nucleic acid will be targeted for mutagenesis, 
and thus the general methodology employed to accomplish 
this is termed site-directed mutagenesis, in the present 
invention the preferred method of site-directed mutagenesis 
is by the method of Kunkel QTrrhnrt* of FnrvmPtoqv 154:357- 
382, 1987). 

Another method for preparing mutants of. this invention 
is by oligo-mediated mutagenesis, oligonucleotide-mediated 
mutagenesis is a preferred method for preparing 
substitution, deletion, and insertion mutants of prohormone 
convertase 1 or 2 encoding DNA. This technique is well 
known in the art as described by Adelman et al., BUA, A: 
183 (1983). Briefly, the prohormone convertase 1 or 2 DNA 
is altered by hvbridizing an oligonucleotide encoding the 
desired mutation to a DNA template, where the template is 
the single-stranded form of a plasmid or bacteriophage 
containing the unaltered or naturally occurring DNA 
sequence of the prohormone convertase 1 or 2. After 
hybridization, a DNA polymerase is used to synthesize an 
entire second complementary strand of the template that 
will thus incorporate the oligonucleotide primer, and will 
code for the selected alteration in the prohormone 

convertase 1 or 2. ^ iAa€ , 
Generally, oligonucleotides of at least 25 nucleotides 
in length are used. An optimal oligonucleotide will have 
12 to 15 nucleotides that are completely complementary to 
the template on either side of the nucleotide (s) coding for 
the mutation. This ensures that the oligonucleotide will 
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hybridize properly to the single-stranded DNA template 
molecule. The oligonucleotides are readily synthesized 
using techniques known in the art such as that described by 
Crea et al. ^^r. MaM , ftp*^- -^i- ug A. 21: 5765 [1978]). 
5 PCR mutagenesis is also suitable for making amino acid 

mutants While the following discussion herein may refer to 
DNA, it is understood that the technique also finds 
application with RNA. The PCR technique refers to the 
procedure outlined in ftenetie Encineering Vol 12 pp 115-137, 
10 [1990] by Arnheim; Analytica l Phpmistrv, vol 62 pp 1202-1214 
[1990] by Gibbs; and science vol 252 pp. 1643-1651 [1991] by 
Gel f and 

Mutants Affecting Polypeptide precursor processing 

One aspect of the present invention is a polypeptide 

15 factor precursor mutant containing modification within the 
nucleic acid encoding pre-pro or pro sequence which 
optimizes the precursor processing by the prohormone 
convertase. Cells vary widely in their processing 
capability, and consequently, in the prohormone maturation 

20 products that are ultimately secreted from them. This 
difference in processing of polypeptide factor precursor 
mutants could be the result of several factors: (a) the 
precursor sequence itself could influence the accessibility 
of the prohormone convertase to. the cleavage site, (b) each 

25 specific pattern could represent a distinctly different 
processing enzyme, (c) there could be a limited number of 
prohormone convertases that exhibit specificity due to 
their location (microenvironment ) or relative abundance 
within the cell, (d) similar precursors could be modified 

30 in the various cell types influencing their accessibility 
to the prohormone convertase. 

In the present invention, one preferred polypeptide 
factor precursor mutant is a human proinsulin mutant having 

. ■ ■ a prohormone convertase cleavage site wherein the cleavage 

35 site is processed in the constitutive pathway of the host 
cell. The preferred proinsulin variant is one comprising a 
residue substitution for naturally occurring residues in 
the Type I and/or Type II cleavage sites as outlined: 
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Proinsulin residue Lys 29 is substituted with any of a 
residue selected from the group of Arg, His. Cys, Met. Phe. 
^ ^^J^^^a^^^^.- Ser. Thr, Asp, Glu, 

o!5^Asn^an^inos^prer"eraoiy~>a:gT- 

proinsulin residue Arg 31 is substituted with any of a 
residue selected from the group of Lys, His. Cys. Met, Phe, 
Tyr, Trp. Pro. Gly, Ala. Val. lie, Leu, Ser, Thr, Asp, Glu, 
Gin, Asn. and most preferably Lys. 

proinsulin residue Leu 62 is substituted with any of a 
residue selected from the group of Arg, His, Cys. Met, Phe, 
Tyr, Trp. Pro. Gly. Ala, Val, lie, Lys, Ser. Thr. Asp. Glu, 
Gin, Asn, and most preferably Lys or Arg. 

Site-directed mutagenesis by the method of Kunkel 
{Kunkel 1987) is a preferred method for preparing 
substitution, deletion, and insertion variants of 
polypeptide precursors. 

A preferred mammalian prohormone convertase 1 (mPCl) 
precursor mutant comprises a residue substitution for 
naturally occurring residues in the prohormone, convertase 
cleavage site of the precursor as outlined: mPCl precursor 
residue Lys 80 is substituted with any of a residue 
selected from the group Arg. His. Cys, Met, Phe, Tyr, Trp, 
Pro, Gly. Ala. Val, He. Leu, Ser, Thr, Asp, Glu, Gin. Asn, 
and most preferably Arg. 

A preferred mammalian prohormone convertase 2 <mPC2) 
precursor mutant comprises a residue substitution for 
naturally occurring residues in the prohormone convertase 
cleavage site of the precursor as outlined. mPC2 precursor 
residue Lys 77 is substituted with any of a residue 
selected from the group Arg, His, Cys. Met, Phe, Tyr, Trp, 
Pro, Gly, Ala. Val. lie. Leu. Ser, Thr. Asp, Glu, Gin, Asn, 
and most preferably Arg. mPC2 precursor residue Arg 78 is 
substituted with any of a residue selected from the group 
Lys, His, Cys, Met, Phe. Tyr, Trp, Pro, Gly, Ala, Val, He, 
Leu, Ser, Thr, Asp, Glu, Gin. Asn, and most preferably Ala. 
mPC2 precursor residue Arg 79 is substituted with any of a 
residue selected from the group Lys, His, Cys, Met. 'Phe, 
Tyr. Trp, Pro, Gly. Ala, Val, He, Leu, Ser. Thr. Asp, Glu, 
Gin, Asn, and most preferably Lys. 
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Insertion of DNA into a Cloning Vehicle 

The cDNA or genomic DNA is inserted into a replicable 
vector for further cloning (amplification cf the DNA) or 
for expression. Construction of suitable, vectors 
5 containing the desired coding and control sequences employs 
standard recombinant techniques. Isolated plasmids or DNA 
fragments are cleaved, tailored, and religated to form the 
desired plasmid. 

Many vectors are available, and selection of the 
10 appropriate vector will depend on 1) whether it is to be 
used for DNA amplification or "for DNA expression. 2) the 
size of the DNA to be inserted into the vector, and 3) the 
host cell to be transformed with the vector. Each vector 
contains various components depending on its function 
15 (amplification of DNA or expression cf DNA) and the host 
cell for which it is compatible. The vector components 
generally include, but are not limited to. one or more of 
the following: a signal sequence, an origin of replication, 
one or more marker genes, an enhancer element, a promoter, 
20 and a transcription termination sequence. 

The preferred replicable vector is pRK5 or pRK7 . 
Signal Sequence Component 

in general, the signal sequence may be a component of 
the vector, or it may be a part of the prohormone 
25 convertase DNA that is inserted into the vector. For 
example the native prohormone convertase DNA encodes a 
signal sequence at the amino terminus (5- end of the DNA) 
of the polypeptide that is cleaved during post- 
radiational processing of the polypeptide to form the 
30 mature prohormone convertase polypeptide. Included within 
" the scope of this invention are polypeptides with the 
native signal sequence deleted and replaced with a 
heterologous signal sequence. The heterologous signal 
sequence selected should be one that is recognized and 
35 processed (i.e. cleaved by a signal peptidase) by the host 
cell. in mammalian cell expression the native signal 
sequence is generally satisfactory, although other 
mammalian signal sequences may be suitable. 
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Origin of Replication Component 

Most expression vectors are -shuttle" vectors, i.e. 

or^^but can be transfected into SSS^gS^^ 
S expression. For example, a vector is cloned in B. coli and 
then the same vector is transfected into yeast or mammalian 
cells for expression even chough it is not capable of 
replicating independently of the host cell chromosome. 

DNA may also be amplified by insertion into the host 
ie genome. This is readily accomplished using Bacillus 
species as hosts, for example, by including in the vector a 
DNA sequence that is complementary to a sequence found m 
Bacillus genomic DNA. Transfection of Bacillus with this 
vector results in homologous recombination with the genome 
IS and insertion of the prohormone convertase subunit DNA. 
Selection Gene Component 

Expression and cloning vectors should contain a 
selection gene, also termed a selectable marker. This gene 
encodes a protein necessary for the survival or growth of 
20 * transformed host cells grown in a selective culture medium. 
Host cells not transformed with the vector containing the 
selection gene will not survive in the culture medium. 
Typical selection genes encode proteins that (a) confer 
resistance to antibiotics or other toxins, e.g. ampicillm, 
25 neomycin, methotrexate, or tetracycline, (b) complement 
auxotrophic deficiencies, or (c) supply critical nutrients 
not available from complex media. 

One example of a selection scheme utilizes a drug to 
arrest growth of a host cell. Those cells that are 
30 successfully transformed with a heterologous gene express a 
protein conferring drug resistance and thus survive the 
selection regimen. Examples of such dominant selection use 
the drugs neomycin (Southern et al.. , T Molf , Appl. 
QsaeS^, 1: 327 119821). mycophenolic acid (Mulligan et al.. 
35 pHanca . 2^: 1422 U980]) or hygromycin (Sugden et al., 
w„i r.ii. Biol. . £: 410-413 [1985]). The three examples 
given above employ bacterial genes under eukaryotic cdntrol 
to convey resistance to the appropriate drug G418 or 
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neomycin (geneticin). xgpr. (mycophenolic acid), or 
hygromycin, respectively. 

An example of suitable selectable markers for 
mammalian cells are those that enable the identification of 
5 cells competent to take up the prohormone convertase 
nucleic acid, such as dihydrof olate reductase (DHFR) or 
thymidine kinase. The mammalian cell transf ormants are 
placed under selection pressure which only the 
transformants are uniquely adapted to survive by virtue of 
10 having taken up the marker. Selection pressure is imposed 
by culturing the transformants under conditions in which 
the concentration of selection agent in the medium is 
successively changed, thereby leading to amplification of 
both the selection gene and the DNA that encodes the 
15 polypeptide of interest. Amplification is the process by 
which genes in greater demand for the production of a 
protein critical 'for growth are reiterated in tandem within 
the chromosomes of successive generations of recombinant 
cells. Increased quantities of the polypeptide of interest 
20 are synthesized from the amplified DNA . 

For example, cells transformed with the DHFR selection 
gene are first identified by culturing all of the 
transformants in a culture medium that contains 
methotrexate (Mtx) , a competitive antagonist of DHFR. An 
25 appropriate host cell when wild-type DHFR is employed is 
the Chinese hamster ovary (CHO) cell line deficient in DHFR 
activity, prepared and propagated as described by Urlaub 
and Chasin. »™r Natl. Aral. Sci. VSA, 22= 4216 11980] . 
The transformed cells are then exposed to increased levels 
3 p of methotrexate. This leads to the synthesis of multiple 
. •' copies of ' the DHFR gene, and, concomitantly, multiple 
copies, of other SNA comprising the expression vectors, such 
as the DNA encoding the prohormone convertase. This 
amplification technique can be used with any otherwise 
35 suitable host. e.g.. 293 or CHO cells. ATCC No. CCL61 CHO- 
Kl. notwithstanding the presence of endogenous DHFR if. for 
example, a mutant DHFR gene that is highly resistant to Mtx 
is employed (EP 117 .-060). Alternatively, host cells 
(particularly wild-type hosts that contain endogenous DHFR) 
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transformed or co- transformed with DNA sequences encoding 

DHFR r otein ii a ° 
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phosphotransferase (APH) can be selected by cell growth xn 
medium containing a selection agent for the selectable 
marker such as an aminoglycoside antibiotic, e g. , 
kanamycin, neomycin, or G418. See U.S. Pat. No. 4,965.199. 
Promoter Component 

Expression and cloning vectors usually contain a 
promoter that is recognized by the host organism and is 
operably linked to the nucleic acid encoding the 
polypeptide of interest. Promoters are 
sequences located upstream ,5-1 to eh. start code* , o a 
structural gene (generally within about 100 to 1000 bp) 
that control the transcription and translation of 
particular nucleic acid sequence, such as that encoding a 
prohormone convertase. to which they are operably linked, 
such promoters typically fall into two classes, inducible 
and constitutive. Inducible promoters are promoters that 
» initiate increased levels of transcription from DNA under 
. their control in response to some change in culture 
conditions, e.g. the presence or absence of a nutrient or a 
change in temperature. At this time a large 
promoters recognized by a variety of potential 
u are well known. These promoters are operably linked to » 
encoding the polypeptide of interest by removing the 
promoter from the source MA by restriction enzyme 
digestion and inserting the isolated promoter sequence into 
Che vector. Both the native promoter sequence and many 
3. heterologous promoters may be used to direct -P""»"° n 
and/or expression of the polypeptide of interest. However 
heteroiogous promoters are preferred, as they 
permit greater transcription and higher yields o. expressed 
polypeptide of interest as compared to the native 
3S P^=rmone convertase promoter- eultaryot es. 
promoter sequences are Known 
Virtually all eukaryotic genes have an AT-rich region 
located approximately 25 to" 30 bases upstream from the sxte 
where transcription is initiated. Another sequence found 
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70 co 80 bases upstream from the start of transcription of 
many genes is a CXCAAT region where X may be any 
nucleotide. At the 3' end of xos: eukaryctic genes is an 
AATAAA sequence that may be the signal for addition of the 
5 poly A tail to the 3" end of the coding sequence. All of 
these sequences are suitably inserted into mammalian 
expression vectors. 

Polypeptide transcription from vectors in mammalian 
host cells is controlled by promoters obtained from the 

10 genomes of viruses such, as polyoma virus, fowlpox virus (UK 
2,211,504 published 5 July 19895 i adenovirus (such as 
Adenovirus 2), bovine papilloma virus, avian sarcoma virus, 
cytomegalovirus, a retrovirus, hepatitis -B virus and most 
preferably Simian virus 40 (SV40), from heterologous 

15 mammalian promoters, e.g. the act in promoter or an 
immunoglobulin promoter, from heat-shock promoters, and 
from the promoter normally associated with the polypeptide 
of interest, provided such promoters are compatible with 
the host cell systems. 

20 The early and late promoters cf the SV40 virus are 

conveniently obtained as an SV40 restriction fragment that 
also contains the SV40 viral origin of replication. Fiers 
et al., Nature . 221:113 (1978); Mulligan and Berg, Sc i ence * 
209 : 1422-1427 (1980); Pavlakis ec al . , ProC , Natl. Acad. 

25 sci. USA . 7£: 7393-7402 (1981). The immediate early 
promoter of the human cytomegalovirus is conveniently 
obtained as a HinclII E restriction fragment. Greenaway et 
al., Gene . I£: 355-360 (1982). A system for expressing DNA 
in mammalian hosts using the bovine papilloma virus as a 

30 vector is disclosed in U.S. 4,419,446. A modification of 
this system is described in U.S. 4,601,978. See also Gray 
et al., N&mx£, 211: 503-508 (1982) on expressing cDNA 
encoding immune interferon in monkey cells; y Reyes et al., 
Mature . 297 : 598-601 (1982) on expression of human 

35 -interferon cDNA in mouse cells under the control of a 
thymidine kinase promoter from herpes simplex virus, 
Canaani and Berg, Prnr Marl Aran. Sci. USA. 22: 5166-5170 
(1982) on expression of the human interferon 1 gene in 
cultured mouse and rabbit cells, and Gorman et al. t PrPC . 
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T7 nn USA . 71: 6777- 6781 (1982) on expression 

of bacterial CAT sequences in CV-1 monkey kidney cells, 

HeLa cells, and mouse NIH-3T3 cells using the Rous sarcoma 
5 virus long terminal repeat as a promoter. 
Enhancer Element Component 

Transcription of a dna encoding the polypeptide of 
interest of this invention by higher eukaryotes is often 
increased by inserting an enhancer sequence into the 
•10 vector. Enhancers are cis-acting elements of DNA, usually 
about from 10-300 bp, that act on a promoter to increase 
its transcription. Enhancers are relatively orientation 
and position independent having been found 5 1 (Laimins et 
al.. r -~ ^-i nsA. 21: 993 [1981]) and 3' 

15 (Lusky ec al.. PM Ml P>o„ 2: 1108 [1983]) to the 
transcription unit, within an intron (Banerji ec al.. Cell. 
22.' 729 11983]) as well as within the coding sequence 
itself (Osborne ec al.. Mm C*U L'- 1293 [1984]). 

Many enhancer sequences are now known from mammalian genes 
20 (globin. elastase, albumin, a-fetoprotein and insulin). 
Typically, however, one will use an enhancer from a 
eukaryotic cell virus. Examples include the SV40 enhancer 
on the late side of the replication origin (bp 100-270). 
the cytomegalovirus early promoter enhancer, the polyoma 
25 enhancer on the late side of the replication origin, and 
adenovirus enhancers. See also Yaniv, mmiSL. 231: 17-18 
(1982) on enhancing elements for activation of eukaryotic 
promoters. The enhancer may be spliced into the vector at 
a position 5' or 3' to the desired DNA, but is preferably 
30 located at a site 5' from the promoter. 
Transcription Termination Component 

Expression vectors used in eukaryotic host cells 
(insect, plant, animal, human, or mammalian cells) will 
also contain sequences necessary for the termination of 
35 transcription and for stabilizing the mRNA. Such sequences 
are commonly available from the 5' and, occasionally 3' 
"untranslated regions of eukaryotic or viral DNAs or cCNAs. 
These regions contain nucleotide segments transcribed as 
polyadenylated fragments in the untranslated portion of the 
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mRNA encoding the prohormone convertase. The 3' 
untranslated regions also include transcription termination 
sites. 

Construction of suitable vectors containing one or 
5 more of the above listed components and including the 
desired coding and control sequences employs standard 
ligation techniques. Isolated plasmids or nucleic acid 
fragments are cleaved, tailored, and religated in the form 
desired to generate the plasmids required. 

Particularly useful in the practice of this invention 
are expression vectors that provide for the transient 
• expression in mammalian cells of DNA encoding the 
polypeptide of interest. In general, transient expression 
involves the use of an expression vector that is able to 
L5 replicate efficiently in a host cell, such that the host 
cell accumulates many copies of the expression vector and. 
in turn, synthesizes high levels cf a desired polypeptide 
encoded by the expression vector. Transient expression 
systems, comprising a suitable expression vector and a host 
20 cell, allow for the convenient positive identification of 
polypeptides encoded by . cloned DNAs, as well as for the 
rapid screening of such polypeptides for desired biological 
or physiological properties. Thus, transient expression 
systems are particularly useful in the invention for 
25 purposes of identifying polypeptide analogs and variants 
that have desired activity. 

Other methods, vectors, and host cells suitable for 
adaptation to the synthesis of the prohormone convertase 
and other polypeptides of this invention in recombinant 
30 mammalian cell culture are described in Gething et al., 
Mature . 222: 620-625 [1981] ; Mantel ec al.. nature, 211: 
40-46 [1979]; Leyinson ec al.; EP 117.060; and EP 117.058. 
A particularly useful plasmid for mammalian cell culture 
expression of the prohormone convercase subunit is pRK5 (EP 
35 pub. no. 307.247) or pSVl6B. 

Selection and Transformation of Host Cells 

Suitable host cells for cloning or expressing the 
vectors herein are the higher eukaryote cells described 
above. Suitable host cells for the expression of 
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glycosylated polypeptide are derived from multicellular 
organisms. Such hose cells are capable of complex 
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higher eukaryotic cell culture is workable, whether iron, 
vertebrate or invertebrate culture. Examples of 
invertebrate cells include plant and insect cells. 
However, interest has been greatest in vertebrate cells> 
and propagation of vertebrate cells in culture (tissue 
culture) has become a routine procedure in recent years 
r »<.«„. mimre . Academic Press, Kruse and Patterson, 
editors (1973)]. Examples of useful mammalian host cell 
lines are monkey kidney CVl line transformed by SV40 (COS- 
7, ATCC CRL 1651); human embryonic kidney line (293 or 293 
cells subcloned for growth in suspension culture, Graham et 
al., t virol. . 2£: 59 [1977]); baby hamster kidney 

cells (BHK, ATCC CCL 10); Chinese hamster ovary cells/-DHFR 
(CHO, urlaub and" Chasin. prpr Natl , ■ USA , 22: 

4216 [1980]); mouse Sertoli cells (TM4 ,. Mather, £ioJ_ 
Kpnrod. . 21: 243-251 [1980]); monkey kidney cells (CVl ATCC 
CCL 70); African green monkey kidney cells (VERO-76, ATCC 
CRL- 1587); human cervical carcinoma cells (hela, ATCC CCL 
2); canine kidney cells (MDCK, ATCC CCL 34); buffalo rat 
liver cells (BRL 3A. ATCC CRL 1442); human lung cells 
(W138, ATCC CCL 75); human liver cells (Hep G2. HB 8065); 
mouse mammary tumor (MMT 060562 , ATCC CCL51); TRI cells 
(Mather ec al.. Prn»H NY Scl. ifll: 44-68 [1982]); 

MRC 5 cells; FS4 cells; and a human hepatoma cell line (Hep 
G2). Preferred host cells are human embryonic kidney 293 
and Chinese hamster ovary cells. 

Host cells are transfected and preferably transformed 
with the above-described expression or cloning vectors of 
• this invention and cultured in conventional nutrient media 
modified as appropriate for inducing promoters, selecting 
• transformants. or amplifying the genes encoding the -desired 

35 sequences. , . 

It is further envisioned that the cells comprising 
nucleic acid encoding the polypeptide of interest of -this 
invention may be produced by homologous recombination, or 
with recombinant production methods utilizing control 
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elements introduced into cells already containing nucleic 
acid encoding the polypeptide of interest currently in use 
in the field. For example, a powerful promoter /enhancer 
element, a suppressor, or an exogenous transcription 
modulatory element is inserted in the genome of the 
intended host cell in proximity and orientation sufficient 
to influence the transcription of nucleic acid encoding a 
desired prohormone convertase. The control element does 
not encode the prohormone convertase of this invention, but 
I affects the prohormone convertase nucleic acid which is 
present in the host cell genome. One next screens for 
cells making the prohormone convertase of this invention, 
or increased or decreased levels of expression, as desired. 
Mammalian cells may be stably transformed using any 
5 acceptable vector known to transform a particular cell 
type. A preferred vector is the P RK5 used in the present 
invention with 293 human kidney cell. For example, the 
mammalian cells are transformed to produce a prohormone 
convertase and these cells are then used to produce 
0 properly processed hormone by transforming them to express 
a prohormone. Among the preferred prohormones are those 
that when processed result in hormones containing two or 
more polypeptide chains. Preferred two chain hormones are 
relaxin and insulin. 
25 Culturing the host cells 

Prokaryotic cells used to produce the polypeptide of 
this invention are cultured in suitable media as described 
generally in Sambrook ez al.. supra. 

The mammalian host cells used to produce the 
30 polypeptides of this invention may be cultured in a variety 
of media. Commercially available media such as Ham's F10 
(Sigma), Minimal Essential Medium ([MEM]. Sigma). RPMI-1640 
(Sigma), and Dulbecco-s Modified Eagle's Medium { [DMEM] , 
•Sigma) are suitable for culturing the host cells. In 
35 addition, any of the media described in Ham and Wallace. 
M «rh. Enz . . 5£: 44 (1979), Barnes and Sato, ftn . rO PlOChem .. 
Ifll, 255 (1980). U.S. 4,767.704; 4.657.866; 4.927.762; or 
4.560.655; WO 90/03430; WO 87/00195; U.S. Pat. Re. 30.985; 
or U.S. 5.122.469. issued 16-Jun-1992 may be used as 
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culture media for the host cells. Any of these media may 
be supplemented as necessary w ith hormones an d/or ^ ot her 

growth factor) , salts (such *s sodium chloride, calcium, 
magnesium, and phosphate), buffers (such as HEPES), 
nucleosides (such as adenosine and thymidine), antibiotics 
(such as Gentamycin™ drug) . trace elements (defined as 
inorganic compounds usually present at final concentrations 
in the micromolar range), and glucose or an equivalent 
energy source. Any other necessary supplements may also be 
included at appropriate concentrations that would be known 
to those skilled in the art. The culture conditions, such 
as temperature. pH. and the like, are those previously used 
with the host cell selected for expression, and will be 
apparent to the ordinarily skilled artisan. 

The foregoing written specification is considered to be 
sufficient to enable one skilled in the art to practice the 
invention. Various modifications of the invention in 
addition to those shown and described herein will become 
apparent to those skilled in the art from the foregoing 
description and fall within the scope of ^ appended 
claims. The following examples are intended to illustrate 
the best mode now known for practicing the invention, but 
the invention is not to be considered limited to these 
examples . 

EXAMPLE 1 

Cloning of Murine Proh ormone Converses Type 1 
(mPCl) and Type 2 imPC^J 
The mouse pituitary tumor cell line. AtT20, was used 
as the source for candidate prohormone convertase mRNA's. 
The A tT-20 cell line was obtained from the American Type 
Culture collection (Rockville. md) . Cells were grown in 
Dulbecco's modified Eagle's medium (DMEM) containing 10% 
fetal calf serum. Total RNA was prepared from confluent 
cells (Maniatis 1983) and cDNA was generated by incubation 
of 5ug total RNA with 400 0 Moloney murine leukemia* virus 
reverse transcriptase (GIBCO-BRL) . 2.5 mg GeneAmp 10 X. 
reaction buffer (Perkin Elmer-Cetus) . 20 u RHasm 
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ribonuclease inhibitor (Promega) . 400mM dNTPs, 0.5 mg oligo- 
<JT (United Scaces Biochemical). 1.4 mM MgCl 2 . 10 km DTT; all 
in a total volume of 25 uL for 1 hour ac 37 degrees C. 

Amplification using a degenerate polymerase chain 

5 reaction (PCR) of candidate prohormone convertase targets 
was carried out in the same reaction tube by adding 50 pmol 
each of forward and reverse MOPAC PCR primers. 0.02 mmol 
dNTPs, with an additional 7.5 mL 10X reaction buffer plus 5 
U Tag DNA polymerase. The primer sequences were based on 

10 the conserved aspartate and serine catalytic residues of 
KEX2, PC2, Proprotease B. and subtilisin BPN as described in 
Smeekens. et al < ThP journal of Biolmiral rh^sr . rY . V ol 
265. pp. 2997-3000 (1990]). The forward primer used was: 
5 ' GCAAAATCTAGA (C/T ) (G/T) GCIAT (C/T) GTIGA (C/T) GA (G/T)GGI3 ' 

15 (Seq. ID #5) and the reverse primer was: 
5'AAGCATGAGCTCIGG(A/G)GC(A/G)GC(A/G)GCIGAICC3' (Seq. ID #6) . 
Thirty cycles of degenerate PCR were done, each consisting 
of denaturation (1 rain, 94 degrees C) . primer annealing (2 
min. 48 degrees C for the first 5 cycles; 2 rain at 55 

20 degrees C thereafter), and primer extension (3 min, 72 
degrees C). The products from the PCR reaction were 
electrophoresed in a 2.5% NuSieve-GTG low melting agarose 
gel (FMC) . Discrete bands of approximately 600 bp were 
excised and subcloned into pUC118 for DNA sequence analysis. 

25 Partial mPCl and mPC2 sequences were derived by this 
procedure . 

A cDNA pool was prepared as a source of template for 
the cloning of full length sequences of the candidate PC 
enzymes (Frohman 1988; innis 1990). Mouse AtT-20 total RNA 

30 (5mg) was heated for 3 min at 65 'C and immediately chilled 
on ice. The denatured RNA was added together with 200 U 
reverse transcriptase to a first strand cDNA synthesis 
reaction containing 4 mL 5 X H-RT buffer (GIBCO-BRL) . 10 mM 
DTT, 1 mM dNTPs, 20 U RNasin. and 2.5 pmol R 0 Ri-d(T)i7 

35 adapter primer: 5' gatatcactcagatcgatgaattcgagctc (t) 17 3- 
(Seq. ID #7) all in a final volume of 20uL. The mixture was 
incubated for 1 hour at 37 degrees C and diluted to 1 mL 
with Tris/EDTA. Aliquots (5-10 uL) of the cDNA pool were 
used for amplifications of unknown 3* and 5' regions by 
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^CE-fCR (rapid amplification of tf» ends. »""-« > «^ 
In nis 1890) using a combination of gene-s pecific PCR primers 

5 ' AAGCTTTCTAGAGGATCCCTCTGGTGGATTTGG3 ' (Seq ID #8) and 5'RACE 
primer 5 ■ AAGCTTGAATTCTCCAACCCCACACTTGTG3 • (Seq • ^ «»™ 
the inner adapter Ri PCR primer 5' AGATCGATGAATTCGAGCTC 
3 .(Seq. ID No. 10). Final construction of the full length 
. mPCl and mPC2 cDNAs was though the technique of recombinant 
PCR (innis 1990) . All PCR cloning reactions were earned 
10 out wich vent DNA polymerase (New England Biolabs) to 
minimize the introduced of mismatched bases during 
amplification of template. 

All mPC clones were sequenced with the Sequenase 2.0 
kit from USB and cloned into PUC118 (Vieira. et al Mfithfiiia 
15 fi nyyTnoloov 153 [1987]). 

EXAMPLE 2 

Kelaxin Production in Transfected Cells 



Construction of relaxin 



and prohormone convertase 



20 expression vectors 

The CDNA clone of the preprorelaxin gene, pCIS.Rx, 
provided the coding sequence of the human preprorelaxin. 
The pCIS mammalian expression vector is described in 
(coJan m *~* ™ »«* 2:3-10. 1990,. The cDNA encoding 
25 H2 preprorelaxin was excised from the cloning vector as 
described in Hudson. P ec al.. Th* FMPO Journal 3, 2333- 
2339 11984) by using a complete Hpa II digest followed by a 
Hinf'l partial digest. pCIS.Rx was digested by restriction 
endonucleases XBA I and ECO RI to excise the coding region 

30 of preprorelaxin. PRK7 was digested by 

endonucleases XBA I and ECO RI. The resultant XBA X/ECO RI 
human preprorelaxin coding region was ligated into the XBA 
l/ECO Ri digested pRK7 yielding the final preprorelaxin 
mammalian expression vector, pRK.Rx. 

35 The CDNA clone of the mPCl gene, PUCllS.mPCl from 

example 1, provided the coding sequence of the mPCl for 
construction of plasmids to direct the expression of mPCl 
in transfected mammalian cells. PUCllS.mPCl is digested 
with restriction endonucleases SAC I and ECO RI. pRK7 is 
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digested with restriction endonucleases SAC I and ECO Rl. 
The resultant SAC I/ECO Rl m?Cl coding region is ligated 
into the SAC I/ECO Rl digesced pRK7 to yield the final mPCl 
mammalian expression vector. pRK.mPCl. 
5 The cDNA clone of the mPC2 gene. P'JC118.mPC2 from 

example 1. provided the coding sequence of the mPC2 for 
construction of plasmids to direct the expression of mPC2 
in transfected manunalian cells. PUC118.mPC2 is digested 
with restriction endonucleases XBA I and ECO Rl. pRK7 is 
10 digested with restriction endonuciease XBA I and ECO Rl. 
The resultant XBA I/ECO Rl mPC2 coding region is ligated 
into the XBA I/ECO Rl digested pRK.7 to yield the final mPC2 
mammalian expression vector. pRK.nPC2. 

The cDNA clone of the KEX 2 gene, P YEp24.pJ28 (Julius. 
15 et al. Cell vol 37, p 1075 [1984] ) .provided the coding 
sequence of the KEX2 gene for construction of the final 
KEX2 mammalian expression plasmid. Vector P YEp24.pJ28 was 
digested with restriction endonuciease Eco Rl and run on an 
Agarose gel. A 3.3 kb fragment containing the coding 
20 region was isolated. pRK5 was digested with restriction 
endonuciease Eco Rl. and then treated with bacterial 
alkaline phosphatase (BAP> . The Eco Rl digested. BAP 
treated pRK5 was run on an agarose gei and a 4.7 kb 
fragment was isolated. 
25 The 3.3 kb fragment of P YEp24.pJ28 was ligated to the 

4.7kb fragment of pRK5 to form vector pRK.KEX.RI. 
pRK.KEX.RI was digested with restriction nuclease Dde I. 
then treated with Klenow. and further digested with 
restriction endonuciease Hind III. The Dde I/Hind III 
30. treated pRK.KEX.RI was run on a polyacrylamide gel and a 
425bp fragment was isolated. pRK5 was digested with 
restriction endonuciease Sma I/Hind III. then treated with 
BAP. run on an agarose gel and a 4.7kb fragment was 
isolated. The 4.7kb fragment of pRK5 was ligated to the 
35 425 bp fragment of pRK.KEX.RI to form vector pRK5.5'KEX. 
P RK5.5'KEX was digested with Hind ill and treated with BAP. 
A 2 kb Hind III fragment encoding the 3' end of KEX2 was 
• derived from vector P YEp24.pJ28 and was ligated to the Hind 
III/BAP treated vector pRK5.5'.KEX forming the final KEX 
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293 cells. 

Human Kidney 293 cells were .subjected to transient 
transfection by the method of Gorman (Gorman PNft Prof, En* 
l£ClL_2:3-10, 1990). Expression vector contaxning 
prbrelaxin cDNA. pRK.Rx, (10ug)' was transfected into 293 
cells alone (100 mm dish), or together with an equal amount 
of expression vectors for mPCl (pRK.mPCl), mPC2 ( P RK.mPC2) 
or kex2 (pRK.kex2) cDNAs. 

Stable transfection of relaxin in Human Kidney 293 

Cell8 G eneral aspects of mammalian cell host system 
transformations have been described by Axel in US 
4 399,216 issued 16 August 1983. The preferred method 

for stable transformation of a mammalian cell of the present 
invention is the calcium phosphate precipitation method 
described in sections 16.30-16.37 of Sambrook et al, supra. 
The preferred host cell is the human embryonic kidney line 
(293 ) or Chinese hamster ovary cells/ -DHFR(CHO) . The 
preferred media contains the antibiotic neomycin or 
geneticin. 

Characterization of Processed Relaxin 
laununoaffinity purification 

Following transfection, culture medium was removed and 
replaced by 2mL DMEM (minus cysteine and methionine} 
containing 200uCi each of 35S-cysteine and 3 5S -methionine 
(Amersham). After a 3 hr" incubation, the supernatant was 
collected and cells were washed with PBS. Cells were lysed 
with 1 mL lysis buffer (150 mM «cl. 1% NP-40, 50ml, Tns- 
HC1 PH 8.0). Both supernatant and whole-cell lysate were 
analyzed by immunoprecipitation. •* 

Supernatants (500uL) and whole-cell lysates (250uL) 
from the transfections were mixed with 5 mg of 
monoclonal antibody (MAbRlx6) for an overnight incubation, 
with gentle rocking at 4 degrees C. Antibody-antigen 
complexes were absorbed onto protein A-Sepharose CL4B beads 
(Pharmacia-LKB) at 4 degrees C for 1 hour. The beads, were 
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briefly centrifuged and gently washed three times with 
lysis buffer. Liquid was aspirated off and 50 mL 2 X 
tricine/SDS sample buffer (25% glycerol, 8% SDS,~1.5% Serva 
Blue G, 0.9 M Tis*HCl pH 8.45) was added to the beads. 
5 Samples were heated for 10 min at 85 degrees C and lOuL 
aliquots were applied to pre-cast tricine/SDS 10-20% 
polyacrylamide gradient gels (Novex) . Following 
electrophoresis, gels were fixed, soaked in Ampligy 
(Amersham), dried and exposed to X-ray film at -80 degree 
10 C. 

Following transfection of 293 cells with the 
expression plasmid for prorelaxin, pRK.Rx, only the 
precursor prorelaxin immunoprecipitated from cell lysates 
or cell supernatants. The prorelaxin bands at about 6.0 KD 
15 on a 10-20% polyacrylamide gradient gel. 

Co-transfection of 293 cells with the expression 
plasmid for prorelaxin, pRK.Rx, and pRK.mPC2, which directs 
expression of the mouse mFC2 cDNA, yields only the 
precursor prorelaxin, indicating that mPC2 is not involved 
20 in processing native prorelaxin in this system. 

Co-expression of prorelaxin and mPCl cDNA's or 
prorelaxin and KEX2 yielded mature, bonafide relaxin, which 
bands at about 3.0 KD on a 10-20% polyacrylamide gradient 
gel . 

25 Reverse phase HPLC and mass spectrometry analysis 

For preparative analysis of cell-secreted protein, 
three 10 cm dishes of confluent cells were transfected with 
substrate proH2Relaxin alone, or with a combination of 
proH2Relaxin (wild-type or mutant) and processing enzyme 
. 30 cDNAs . A total of 30 uL of supernatant was collected from 
each transfection and passed twice over a small (300uL) 
column of anti-relaxin monoclonal antibody linked to 
Sepharose-CL4B for immunoaf f inity purification. The column 
was washed with PBS and no-specif ically absorbed material 
35 was removed by a 1 M NaCl in PBS wash, followed by 2 M 
guanidine»HCl in 10 mM Tris.HCl pH 7.5. Antibody -bound 
protein was eluted with a small volume of "4 M 
guanidine-HCl/Tris •HCl • 
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The immunoaffinity column eluant was applied without 
furxh^Durificai^ 

of a linear gradient of acetonitrile in 0.1* 
5 Trifluoroacetic acid from 20% (0-5 min) to 60% (5-45 mm) . 
The flow rate was held at 0.5 mL/min and the absorbance at 
the column outlet was monitored at 214 nm. Eluants from 
. major peaks of absorbance were collected for analysis by 
mass spectrometry (Figure 8). 
10 The intact molecular weights of prorelaxin and 

processed relaxin were measured by electrospray ionization 
mass spectrometry. Micromolar solutions (l-10pmol/ul) of 
protein in aqueous acetonitrile (50/50. v/v) with 1% acetic 
acid were infused at 1.5 ul/min into a SCIEX API III mass 
15 spectrometer. The quadrupole mass spectrometer was 
operated with the lonspray articulated nebulizer (SCIEX, 
Thornhill, Canada) at 4600 V, the interface plate at 600 V. 
and the orifice at 100-120V. 

Data were collected every 0.1 v with the quadrupole 
20 scanning from 600-2200 u in 32 seconds. Molecular weights 
of the individual A and B chains were determined by fast 
atom bombardment (FAB) mass spectrometry following on-probe 
reduction as described by Stults et al. Wr m** mviron Mass 
SBgcircm 19:655-664 {1991}). 
25 Transaction of pRK.Rx alone resulted in the secretion 

of prorelaxin as the major product. This species had an 
HPLC elution time of 29 minutes. Co-transf ection of pRK.Rx 
' with either pRK.mPCl or P RK.KEX2 generated a major species 
that gave an HPLC retention time of 17 minutes. This 
30 retention time was very similar to that observed for 
authentic mature relaxin purified from human corpus luteum. 

Co-transfection major peaks eluting from the column 
were collected for analysis by mass spectrometry. 
Electrospray measurements generated mass values consistent 
35 with authentic mature relaxin: This was confirmed further 
by fast atom bombardment analysis of . reduced 5963 Da 
species. Two peaks of 2657 and 3313 mass units Were 
obtained, exactly corresponding to the predicted values for 
reduced relaxin A and B chains, respectively. Material 
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from the 5963 Da relaxin was subjected to several rounds of 
N-terminal peptide sequencing in a gas phase sequencer. No 
sequence was obtained for the A-chain as expected due to 
the pyroglutamyl residue that blocks its N-terminus. All 
5 amino acids detected were from the B chain and had a 
peptide sequence consistent with correct removal of the 24 
amino acid signal sequence that precedes the B chain. 
Additionally, the mature B-chain terminates with a serine 
residue at its C-terminus suggesting that carboxypeptidase 
10 activity present in the 293 cells removed the residual 
basic residues. KR, which remain following cleavage at the 
• B-chain / C-peptide junction. 

EXAMPLE 3 

Human Relaxin Mutants and PC Specificity 
15 prohormone convertase substrate specificity is 

exemplified by processing of human prorelaxin mutants. A 
series of mutant prorelaxin cDNAs were made in which basic 
residues thought to be required for processing were 
replaced with alanine. Two dibasic residues. KR. are 
20 located at the B-chain / C-peptide junction and both are 
required for processing, when either of these basic amino 
acids is replaced by alanine, unprocessed prorelaxin is 
secreted from the 293 cells. There is no indication of 
intermediate proteins indicative of a partially processed 
25 precursor, which might have resulted from partial cleavage 
at the C-peptide / A-chain junction. 
Site-directed mutagenesis of prorelaxin 

Site-directed mutagenesis was performed on the relaxin 
mammalian expression vector. pRK.Rx. by the method of 
30 Kunkel (Kunkel 1987) with some minor modifications. 
Transformation-competent <CaCl 2 > CJ236 strain of E. Coli 
was used as the bacterial host for the synthesis of uracil- 
coiitaining template phagemid DKA. and T 4 - enzyme. The 
oligonucleotide extension reaction was performed in a 10 uL 
35 volume for an initial period of 15 min at room temperature, 
followed by a 75 min incubation at 37 degrees C. The 
reaction was terminated by the addition of Tris/EDTA buffer 
to a final volume of 50 uL. A 10 uL aliquot was used to 
transform an ung* strain of £.. coJi. MM299, for selection 
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against uracil-containing wild-type template DMA. This 
strain was also used to produce single strand DNA for use 

(Sequenase kit, United States Biochemical) . 
Identification of prohormone convertase eubstrate 

specificity. 

Prorelaxin alanine mutations of the basic residues K 
and R were constructed to define murine prohormone 
convertase 1 specificity. Figure 11. The following 
nutations were tested for substrate specificity to murine 
prohormone convertase 1: 1.4 . 2.4. 3.2, 4.3. 7.2 and 8 6 
The relaxin (Rx) mutants have the following sequence in the 
prohormone convertase cleavage site. 



IS c-chain/A-chain junction mutants 



Native Rx 



20 RX 1.4 



Rx 2.4 



Rx 3.2 



Rx 4.3 



H • s r K K R Q (Seq. No. #48) 
CAT TCT CGA AAA AAG AGA CAA (Seq. NO. #H> 

H S V K K R Q ^eq. No.- #49) 
CAT TCT GTA AAA AAG AGA CAA (Seq. No. #12) 

H S R A K R Q (Seq. No. #50) 
CAT TCT AGA GCA AAG AGA CAA (Seq. NO. #13) 

H s R K A R Q (Seq. No. #51) 
CAT TCT AGA AAA GCA AGA CAA (Seq. No. #14) 

H S r K R A Q (Seq. No. #52) 
CAT TCT AGA AAA AGA GCA CAA (Seq. NO. #15) 



B-chain/C-chain junction mutants 



Native Rx 



Rx7.2 



T w S K R S L (Seq. No. #53) 
ACC TGG AGC AAA AGG TCT CTG (Seq. No. #16) 

T w S A R S L (Seq. No. #54D 
ACC TGG AGC GCT 'AGG TCT CTG (Seq. No. #17) 
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Rx 8.6 t W S K A S L (Seq. No. #55) 

ACC TGG AGC AAA GCT TCT CTG (Seq. No. #18) 

Figure 9 illustrates murine prohormone convertase 1 
5 processing of these mutants. Mutants 1.4 and 2.4 are 
processed the same as wild-type and mutant 3.2 is partially 
processed. Mutant 4.3 doesn't appear to be processed and 
mutants 7.2 and 8.6 are not processed. Disrupting the B-C 
chain junction at either the position mutated at mutants 

10 7.2 or 8.6 prevents any cleavage of prorelaxin. The OA 
chain is probably not available for cleavage by murine 
prohormone convertase 1 until the B-C chain junction is 
cleaved. This data supports a progressive cleavage 
mechanism involving cleavage of this upstream site first. 

15 The B-C chain site requires both basic residues. Mutants 
3.2 and 4.3 point out that while basic residues are 
required (as seen with 7.2 and 8.6) they are not 
sufficient. Both mutants 3.2 and 4.3 have intact KR sites 
and mutant 3.2 is not cleaved as well as wild- type and 

20 mutant 4.3 is not cleaved by murine prohormone convertase 
1. 

Murine prohormone convertase 2 does not cleave the 
wildtype prorelaxin. Murine prohormone convertase 2 does 
not cleave any of the prorelaxin mutants with the exception 

25 of 4.3 where there is some processing. This suggests that 
there is a difference in the specificity of cleavage for 
murine prohormone convertase 1 and murine prohormone 
convertase 2 which is defined by sequences other than the 
dibasic residues. Non-substrate proteins can be made to be 

-30 substrates by appropriate mutagenesis. Mutant 7.2 is not 
cleaved by murine prohormone convertase 2. Mutation of an 
upstream site in prorelaxin prevents downstream cleavage 
illustrating the presence of progressive cleavage of 
prorelaxin. 

35 EXAMPLE 4 

Construction of Proinsulin Mutants and Insulin 
Expression in Transfected 293 Cells 
The cDNA clone of the human preproinsulin gene, 
pSVEHIGDHFR, described in Australian patent 616,201 issued 
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February 18, 1992, provided the coding sequence of the human 
preproinsulin ge ne for Che final mammalian expression 

amplification of the coding region by RACE-PCR (rapid 
amplification of cDNA ends. Frohman 1988; Innis 1990) using 
a combination of gene-specific PCR primers: for the 5' end 
they were CAT AAG CTT ACC ATG GCC CTG TGG ATG CGC (Seq ID 
#18) (sequence given 5- to 3' Land for the 3- end they were 
CAT TCT AGA CTA GTT GCA GTA GTT CTC CAG (Seq. ID #19) 
(sequence given 5' to 3'). All PCR cloning reactions were 
carried out with Vent DNA polymerase (New England Biolabs) 
to minimize the introduction of mismatched bases during 
amplification of template. All proinsulin clones were 
sequenced with the Sequenase 2.0 kit from USB and cloned 
into restriction digested P RK5. The final preproinsulin 
mammalian vector is pRK. proins. 

A human proinsulin mutant having a non-naturally 
occurring prohormone convertase cleavage site is 
constructed by mutating the human proinsulin cDNA, 
•pRK.proins, encoding the naturally occurring basic cleavage 
site at the B-chain/C-peptide junction (KTRR) (Seq ID #20) 
and/or A-chain/C-peptide junction (LQKR) (Seq ID #21) by 
site-directed mutagenesis (Kunkel 1987). The following 
proinsulin variants were constructed: proins.RTKR.Ip (Seq 
ID #22). proins.RQKR.IIp (Seq ID #23). proins .KTKR.lp (Seq 
ID #24). The following double proinsulin variants were 
constructed: proins. KR. Ip/RQKR. Hp (Seq ID #23), and 
proins.RTKR.Ip (Seq. ID #22) /RQKR.IIp (Seq. ID #23). IP is 
the Type I enzyme cleavage site and Up is the Type II 
enzyme cleavage site. 

Primers used in proinsulin mutant construction were the 
following : 
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PROINS.RTKR.IP 



PROINS.RQKR.IIP 



PROINS. KTKR.lp 



CTCTGCCTCCCGCTTGGTCCTGGGTGTGTAG 

(Seq. ID #25) 
CACGCTTCTGCCGGGATCCCTC 

(Seq. ID #26) 

CTCTGCCTCCCGCTTGGTCTTCGGTGTGTAG 
(Seq. ID #27) 
56 
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The sequences are listed in 5'-->3' orientation. 

The double proinsulin mutant proins . KR. Ip/RQKR. Hp 
(Seq. ID #23) changes the native human proinsulin residue 
Arg 31 to Lys 31 at the Type I enzyme cleavage site, and 
proinsulin residue Leu 62 is changed to Arg 62 ac the Type 
II enzyme cleavage site. 

The double proinsulin mutant proins RTKR.lp (Seq. ID 
t22)/RQKR.IIp (Seq. ID #23) changes the native human 
proinsulin residue Lys 29 to Arg 29 at the Type I enzyme 
cleavage site, residue Arg 31 to Lys 31 at the Type I 
enzyme cleavage site and proinsulin residue Leu 62 is 
changed to Arg 62 at the Type II enzyme cleavage site. 

All mutants were screened through the primer regions 
15 with the Sequenase 2.0 kit from USB. 

DNA from ail proinsulin mutants and naturally 
occurring proinsulin was CsCl banded and used for 293 cell 
transient transfection by the method of Gorman (Gorman EHA 
Pr nr P.na Tech 2:3-10, 1990). The next day, cells were 
20 labeled with ass Met and :-f.S Cys (200uCi/ml) and labeled for 
4-6 hours. 

Immunoaf f inity purification 

Supernatants (500uL) or whole-cell lysates (500uL) 
from the transf ections were mixed with 4 ul of Cone. Guinea 

25 Pig Anti-Human Insulin Lot 101, from 3iomeda for an 
overnight incubation, with gentle rocking at 4 degrees C. 
Protein A Sepharose (CL4B- Pharmacia 17-0963-03) was 
prepared by washing 5 mis of Protein A Sepharose with NP40 
lysis buffer. The wash procedure was repeated 4 times. 

30 After the last wash was aspirated, enough buffer was added 
to provide a 25% v/v slurry of Protein A Sepharose-Cl4B. 
100 ul 25% protein A slurry was added to each overnight 
sample containing the Cone. Guinea Pig Ant i -Human Insulin 
Lot 101. from Biomeda. Incubate 1 hour at 4 degrees with 

35 rocking. Centrifuge 15 seconds, aspirate supernatant into 
waste, and wash 3 times with NP 40 lysis buffer. Aspirate 
buffer. Add 30 ul 2x Tricine-SDS sample buffer to- each 
sample tube. Heat 100 degrees for 10 minutes and apply 
lOuL aliquots to pre-cast cricine/SDS 16% polyacrylamide 
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gradient gels (Novex) . Following electrophoresis, gels 
were fixed, soaked in Ampligy (Amersham) , dried and exposed 
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Gel results . 

Transfections using native human proinsulin DNA 
resulted in a prominent protein band at 6.5 KD and a faint 
band at 3.4 KD. Transfections using the DNA ot 
proinsulin variant proins.RTKR.lp (Seq. ID #22) /RQKR.IIP 

(Seq. ID #23), resulted in protein bands at 3.4KD and 2.35 
KD where authentic insulin B and A chains, respectively, 
run on 16% tricine/SDS polyacrylamide gradient gels 

(NOVEX) • 

Transfections using the DNA of proinsulin mutants, 
proins.RQKR.Hp (Seq. ID 123) or • double mutant 
proins.KR.Ip/RQKR.HP (Seq. ID #23), resulted in two bands 
one band at about 6.0 KD representing the B-chain/C-peptide 
intermediate band and one band at 2.35 KD representing the 

insulin A-chain. 

Transfections using the DNA of proinsulin variant. 
proins.RTKR.lp (Seq. ID #22). resulted in two bands, one 
band running slightly below the B-chain/C- P eptide 
intermediate band representing the c-peptide/A-chain 
intermediate band and one band at 3.4 KD representing the 

insulin B-chain. > 

Transfections using the DNA of proinsulin mutant 
proins.KTKR.I P (Seq. ID 124) resulted in one prominent band 
at 6.5 KD and a faint band at 3.4 KD. 

EXAMPLE 5 

Construction of Prohormone Convertase Mutants 
Prohormone convertase precursor residues involved in 
pressing of eh. precursor form of the W-on. 
convertase to the bioactive form of the prohormone 
convertase are mutated such that the prohormone convertase 
precursor is processed by the prohormone convertase 
naturally occurring in the human kidney 293 cell 

Site-directed mutagenesis of the mammalian expression 
vector. pRK.mPCl or mammalian expression vector, pRK.mPC2 
is by the method of Kunkel (Kunkel 1987). The following 
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primers were used in che prohormone Convercase precursor 
mutant construction: 

mPCl primer GAT ATG AAG AGC AGA TCT TTT GGA CC7 CCG AGG ATG 
5 (Seg. ID #28) 

mPC2 primer CTT ATG GTG TAA GCT TCG TTT TGC TCT GGC CTT TGC 

AAG (Seq. ID #29) 

10 where primers are given in a 5' to 3 ' direction. 

The PCI primer results in a mammalian prohormone 
convertase 1 mutant having the native residue ARG.80 
mutated to LYS.80. 

The PC2 primer results in a prohormone convertase 
15 mutant having the native mammalian prohormone convertase 2 
residues LYS.77 ARG.78 ARG.79 mutated to ARG.77 ALA. 78 
LYS.79 

All mutants were screened through the primer regions 
with the Sequenase 2.0 kit from USB. 
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EXAMPLE 6 

Processing of NGF, BNDF and NT-3 Mutants by mPC 1 

and 2 

To elucidate the role of the precursor in the 
processing of the neurotrophic factors, chimeric pRK5 
expression vectors were constructed to contain the 
precursor of NGF, NT-3, or BDNF linked to the mature 
portion of NGF or the NT-3 precursor linked to the mature 
portion of BDNF. The entire PCR amplified region of each 
chimera containing the ' NGF mature sequence was found to be 
identical to the published sequences for each precursor 
portion and for the mature NGF with the exception of a 
silent point mutation in codon #403 in pRK5 . BDNFNGF . 
Plasmid v vector pRK5 was used in all constructions. 
Nerve Growth Factor (NGF) , Neurotrophic Factor 3 
(NT-3), or Brain Derived Neurotrophic Factor (BDNF) 

Nerve Growth Factor (NG?) . Neurotrophic Factor 3. (NT- 
3), or Brain Derived Neurotrophic Factor (BDNF) cDNA were 
inserted into the cloning linker of pRK5 producing 
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pRKS.KGF. PRK5.BDNF. and pRKS.NT-3. "\ 

transformed into E. coli 29« e.ll. " 

mM KH2P04) • The plasmid DNA was isolated by alkaline lysis 
and purified via cesium chloride double banding according 
to the procedures in Sambrook et al. (1989). 
pRK.NMNGP plaemid construction 

DNA encoding prepro NT-3 was spliced to the DNA 
encoding the mature portion of NGF using a polymerase chain 
reaction (PGR) (Saiki et al.. 1985, Kullis et al. 1986) 
0.5 ug of both pRK5 . NT 3 and pRK5 .NGF were linearized with 
Hind III. The precursor portion of NT 3 was amplified 
using primers a and b while the mature portion of NGF was 
amplified using the c and d primers as shown in Table 1. 

PRIMER SEQUENCES'-;^' 

a TACAACTCACCGCGGGTCCTG (Seq ID #30) 

1247-1267 NT3 /S 
b ' AAGATGGGATGGGATGATGACCGTTTCCGCCTTGATGT (Seq ID 

#31) 

1349-1366 NT3 /AS 
1407-1426 NGF /AS 

' 0 ACATCACGGCGGAAACGGTCATCATCCCATCCCATCTT (Seq ID 
#32) 

1349-1366 NT3 /S 
1407-1426 NGF /S 

d GATATAAGCTTGAGAGTGTAGAAGGGGC (Seq ID #33) 
1794-1810 NGF /AS 



40 S^ffl^^^ 

■AS" indicates antisense primers). 
. pcr was performed on a Ther M l cycler .60 . (PerWn-Elmer 
cecus. uTder the folding conditions: 100 «. reaction 
4S volumes. 50 P»ol primers, 200 p, dNIP's. 10 m b- 
mercaptoethanol. 16.6 m (NB4> 2 S04. 67 nM 8 ;°' 

6.-7 m mci2. »» ED ™' °- 15 m '"" 1 BSA - 
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reaction cycle consisted cf denatviracion at 94 degrees C 
for 1 rain, annealing at 55 degrees C for 1 min. and 
elongation at 72 degrees C for 3 rain. After 35 cycles, 
aliquots of PCR reactions 1 and 2 were purified on an 8% 

5 acrylamide gel. Bands of 119 base pairs for the NT3 
precursor and 414 base pairs for NGF mature were cut out 
and the DNA was recovered via electroelution as previously 
described {Maniatis et al. 1982). 

Purified products from reactions 1 and 2 were spliced 

10 together in PCR reaction 3 using the a and d primers under 
the same amplification conditions. (Both a and d primers 
contain a restriction site that will be used for cloning 
back into the pRK vector.) 

Final PCR reaction products were phenol /chloroform 

IS extracted, ethanol precipitated, and digested to completion 
with Ssc II and Hind III. Digestion products were 
electrophoresed on a 1% NuSieve gel. The 533 base pair 
fragment was cut out of the gel and used in the ligation 
reactions described below to generate the pRK5 .NT3NGF 

20 'fusion plasmid. 

Plasmid pRK5.NT3 was digested to completion with 
tfindlll and Eco RI in one reaction and Eco RI and Ssc II in 
a second reaction. Reaction products were electrophoresed 
on a 1% NuSieve gel. The 4681 and 337 base pair bands were 
25 cut out of the gel, melted, and ligated together with the 
melted 533 base pair fusion product described above 
according to standard protocol. (Maniatis, 1989).. 
Ligation products were melted for 5 minutes at 65 degrees 
C, diluted with 100 Jil 10 mM MgCl2. and transformed into E. 
30 coli 299 cells. After picking colonies and screening them 
via restriction endonuclease digestion, single stranded DNA 
was prepared and sequenced through the entire PCR fusion 
area as previously stated. ■ -- ■ 

PRK5.BDNFNGF plasmid construction 
35 The BDNF precursor was fused to mature NGF in a 

similar manner as above with the exception chat the fusion 
area was created in just one PCR reaction. 0.5 mg of Hind 
III linearized pRK5.BDNF and pRK5 . NGF DNA was amplified in 
one reaction containing all four primers: a.b.c. and d 
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(Table 2). All other conditions of the amplification were 
Ustatea above. The PCR products were phenol/ chloroform 

with Bsc Bland Hind III." Digestion products were 
: ectrophoresed on a li NuSieve gel and the 650 base paxr 
fusion band was cut out and saved for the Ixgatxon ^ 

Primers used in pRK.5 . BDNFNGF construction are 

shown below: 

PRIMER SEQUENCE 5 ■ - _>3 ' LOCATION /STRAND 

; GGCTTGACATCATTGGCTGACACTTTCGAACACATGATAG (Seq ID 
#34) 

1332-1371 BDNF / S 
b AAGATGGGATGGGATGATGAGCGCCGGACCCTCATGGACAT 

(Seq ID #35) 
1533-1553 BDNF / AS 
1407-1426 NGF/ AS 
c ATGTCCATGAGGGTCCGGCGCTCATCATCCCATCCCATCTT 

(Seq ID S36) 
1533-1553 BDNF / S 
1407-1533 NGF / S 
25 d GATATAAGCTTGAGAGTGTAGAAGGGGC (Seq ID #37-) 

1794-1810 NGF / AS 

The location oi I^^^ST^'p^^ 

30 well as the strandedness ( ^ """^ 
■AS" indicates ant isense primers) . 

,U-U pRKS . BDNFNGF «as constructed by d«-t£ 
pRK5 - BDNF CO «*1.C1» with Hind »^'^; £ 

produces ware electrophoresea on a U NuSieve g 

35 U base pair .and - c - ^s w ^ 

melted 650 bp PCR fusion band. Ligation proauc 
"ansLed. screened, and sequenced thrown the PCR re 9 xon 

as stated above. . 
DNA for all of the above constructs was prepared via 

40 cesium chloride banding as stated earlier. 

PRK5.N-P3BDNF Plasndd Construction diaested to 

Plasndd pRKS.NT-3 and pRK5 . BDNF were digested to 
c 0mD letion with Esp I and Kpn I. Digestion products were 
r^ToZ:^ on P a » .Sieve gel. The 4810 and ^base 
45 pair fragments were cut out of the gel and Ugated 
together. The missing 126 base pair region contammg the 
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3- end of the NT-3 precursor and the 5" end of the BDNF 
mature was prepared using synthetic DNA oligonucleotides. 
Six overlapping primers (sense and anci -sense, Table 3) 
were ligated together in one reaction according to standard 
5 protocol. The ligation products were then digested with 
Esp I to generate the 126 base pair fusion fragment. 
Plasmid pRK5 . NTBD was digested to completion with Esp I and 
ligated to the 126 base pair fragment. Following 
screening, pRK5 . NT3 BDNF was sequenced throughout the entire 
10 126 basepair fusion area. 

Primers Used in NT3BDNF Synthetic Fusion Area are 

shown below. 

Primer Sequence 5'-->3' 



15 Is TGAGCGACAGCACCCCCTTGGAGCCCCCGCCCTTGTATCTCATGGAGGATT 
(Seq ID #38) 

la TCAGCTCCCCTCGTCGGGCGGGGTCCGAGTGCCGTTTCCGCCGTGATGTTC 
(Seq ID #39) 

2s ACGTGGGCAGCCCCGTGGTGGCGAACAGAACATCACGGCGGAAACGGC (Seq 

20 ID #40) 

2a TGTTCGCCACCACGGCGCTGCCCACGTAATCCTCCATGAGATACAAGG (Seq 

ID #41) 

3s ACTCGGACCCCGCCCGACGAGGGGAGC (Seq ID #42) 
3a GCGGGGGCTCCAAGGGGGTGCTGTCGC (Seq ID #43) 

25 

The "S" following the number of the plasmid indicates a 
sense strand plasmid and the -a" indicates an anti-sense 
strand. The primers were constructed to overlap each other 
when ligated together. 

30 Kunkel mutagenesis 

To control for variation in the 5' portion of each 
gene within the vector and to optimize conditions for 
translation of the neurotrophic mRNA's. an 'ACC* 
translation consensus sequence was added to the 5' ends of 

35 the cDNA inserts and extra base pairs of 5' sequence 



63 



WO 93/11247 

originally cloned into pRK5 . BDNF (including two ATG sites, 

2 I!" asmid "i was transformed into JJ1 
content E. coli strain CJ236. Single stranded uracil 
containing template P»A was prepared accords, to Sambrook 
« al (1989) the exception that after PEG 

recitation, the pellet was resumed in 100 ,* . 
extracted once with 50 (U. phenol/chlorofor» (1.1). »<» 
twice with 500 UL chloroform. After ethanol P««» l ~ 
trpeuet was Suspended in 45 p. dH 2 o and spun-dxalys d 
through a Sepharose CL-6B (Pharmacia / LKB) column. 100 
pmol synthetic oligonucleotide primers contains the 5 
Ice (Table «) were phosphorylated with T4 polynucleot.de 
Sa e. Mutagenesis was performed using the Huta-Gene 
Phagenid in vitro Kutagenesis kit (BioKad. according to 
LtLturer. instructions. The products were transfe r 
into E.coli 299 cells and resulting colomes were Packed 
and screened. Single stranded D»A was prepared for 
ana screeneo _ following 

sequencing (Sambrook et al.. X989I with 
edifications: precipitation of the phage^d particles was 

the pricing area with the Se,uenase Version 2.0 Seouencmg 
Kit (USB) according to manufacturer's instructions. 

T ab!e 4: Primers .antisense. used in *■ Kunkel Hucagenesis 

Primer Sequence 5'-->3 § 
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QTiU3AACAACAT(MACATGGTGGCAATATTGTCGACT£T5G^GTCGACCTGCAG 
(Seq ID #44) 

MT3 . 5 ' C ACATAAAACAAG ATGGAC ATGGTCTTGTTCACCTGTASJSAICC.CCGG 
(Seq ID #45) 

5^A^AAGGAAAAGGATGGTCATGGTGGAGGTCGA£MS£2XGAGAATTCAATCG 
(Seq ID #46) 
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The altered restriction sites used for screening purposes 
are underlined. 

Cell Culture--ELISA 

5 Plasmids were prepared for transfection via cesium 

chloride double banding.. Both the original plasmids and 
the 5' end mutagenized plasmids pRK5 . NGF , pRK5 .NT3NGF, and 
pRK5 . BDNFNGF were transfected along with an hGH control 
plasroid into the human kidney cell line (293) using the 
10 calcium phosphate precipitation method (Gorman et al. 
1990). 36 hours post-transf ectior. the supernatants were 
assayed for NGF and hGH (as a control) expression with the . 
enzyme linked immunosorbent assay (ELISA) . ELISA's were 
performed by Immunoassay Services. Genentech, Inc. 

15 supernatants of all construct transf ections were also 
assured to have bioactivity by their effects on dorsal root 
ganglia and sympathetic neurons. 

Cell culture — radioactive protein labelling 

Plasmids pRK5 .NGF , pRK5.NT-3. pRK5 . BDNF, pRK5 .NT3NGF, 
20 pRK5. BDNFNGF and pRK5 . NT3BDNF were transfected into the 
human kidney cell line (293). 24 hours post transfection. 
the cells were washed with PES and labelled for 12-14 hours 
with [35 S ] -methionine and [35 S ] -cysteine at 200 uCi/ml in 
cysteine/methionine minus DMEM media. Selected plates of 
25 cells were labelled with either l 35 S] -cysteine alone, or 
t 35 S ] -methionine alone in cysteine minus media, or 
methionine minus media, respectively. The radioactive 
supernatants were collected- and either concentrated 5-10 
fold in a Centricon-10 (Amicon) according to manufacturer's 
30 instructions or immunoprecipitated along with cell lysates 
' as stated below. One-fifth of each concentrated sample was 
loaded on a pre-cast 16% Tris-glycine. SDS-PAGE denaturing 
reducing mini-protein gel (1.5 mm) and electrophoresed 
according to manufacturer's directions. (Novex) . The gels 
35 were fixed with Amplify (Amersham) as suggested in the 
instructions. Fixed, dried gels were exposed to film at 
-70 degree C for 8 to 24 hours. 

Cell lysates were collected by washing the cells with 
PBS and lysing with triton lysis buffer. (1% triton. 5mM 
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HEPES pH 7.2). cell, debris was removed by centrif ligation. 
Lysa tes and supernatants were immunoprecipitated as 

Rabbit anti-NGF and goat anti-hGH antibodies were 
S provided by Immunoassay Services, Genentech, Inc. One half 
of each supernatant and lysate was immunoprecipitated with 
rabbit anti-NGF antibodies alone. The other half, serving 
as a transfection control, was co -immunoprecipitated with 
the rabbit anti-NGF and goat anti-hGH antibodies. 
10 [3 5 S ] -labelled supernatants and lysates were 

preincubated with 100 UL 10% Pansorbin cells 
( staphylococcus aureus) in phosphate buffered saline 
(Calbiochem) for 30 co 60 minutes at 4 degrees C with 
rotation. Cells were removed by centrif ugation and the 
15 supernatants were incubated as above with saturating levels 
of rabbit anti-NGF antibodies or rabbit anti-NGF and goat 
anti-hGH antibodies. 120 UL 10% Pansorbin cells were added 
to the mixture and incubated for 1 hour at 4 degrees C with 
rotation. Each tube was centrifuged for 2 minutes in a 
20 microcentrifuge. Supernatants were removed and the pellets 
were washed once with Buffer #1 (IK NaCl, 0.05 M Tris pH 
6.8 in water), twice with Buffer #2 (1M Tris pH 8.8, 0.2M 
NaCl , 1% NP40, 0.3% SDS). and once with cold dH 2 0. washed 
pellets were resuspended in . 40 uL 2X Tris-Glycine SDS 
25 loading buffer. 20 UL was loaded on a pre-cast 16% Tns- 
glycine SDS reducing mini-protein gel (1.5 mm) and 
electrophoresed according to manufacturer's 
directions. (Novex). The gels were fixed with Enhance 
(Dupont) as suggested with the exception that the final 
30 wash in water included 5% glycerol to avoid gel cracking. 

Fixed, dried gels were exposed to film at -70 degrees 
C for 9-66 hours. 

Effects of precursor on NGF expression' 

The effects of the 5' mutations on secreted NGF levels 
35 were determined by three hGH controlled experiments 
performed in duplicate. In these experiments, the hGH 
controlled for potential differences due to transfection 
efficiencies. For example, in any one experiment, NGF and 
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hGH expression varied with each plate. Since each NGF 
expression value was compared only to the hGH expression in 
the same plate, relative comparisons could be made. ■ 

The results of the 5' changed plasmids (pRKS.NGF 
5 5 'ACC, pRK5.NT3NGF5'ACC, and pRK5 . BDNFNGF 5'ACC) are 
summarized in Table 5. The chimeric proteins were compared 
to the wildtype NGF expression level which was set at 100 
%. Though there seems to be about a 2 -fold difference in 
the levels of NGF production from pRK5 . NGF and pRK5 .NT3NGF, 
10 the production levels from pRK5. BDNFNGF have dropped about 
50-fold. 

Table 5: Effects of Precursor on Expression. 
15 Plasmid NGF/ hGH Average I of NGF Expression 



pRK5.NGF5'ACC 

20 



25 

pRK5.NT3NGF5'ACC 

30 

pRK5 . BDNFNGF 5 ' ACC 

35 

- Since it- is in its normal form (wildtype), pRK5 . NGF 5 ' ACC 
40 "was given the value of 100 %. The two chimeras were then 
compared as a percentage of this value in the last column. 

There are similarities and differences in the amino 
acid sequences of NGF, NT-3 and EDNF. Most" apparently . the 
mature protein factors are quite homologous (> 50%) 
45 especially when compared to the precursor portions of each 
preproprotein . (- 20% homology). 

Some major differences are observed when comparing the 
three precursors. First, NGF and NT-3 contain the dibasic 
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« a ir Lvs-Arg at the primary processing site while BDNF 




sites whereas HT-3 has only one end BDBF does not l» • 
s cell culture with protea.ee Xe*2. -01. «« -« 

Plasmas pRK5 .BDNF and PRK5.W-3 were transacted as 
above with the exception that an additional 3 , ^ of 
session plasmid P RK.Kex2. p R K.mPCl. or pKK.»^ » P« 
plate U00»> was added to the transfection mixture Given 
i. the difference in the dibasic cleavage pattern . with each 
precursor, the effects of the enzymes. Kex2. mPCl and mPC2 
on HT-3 and BDNF were examined. 

Kex2 cleaved the upstream Arg-Arg site in HT-3 . but 
was unable to process , the BDHF upstream region because 

m there is no such sice. 

mPCl and. mPC2 have no additional effect on the 
processing of NT-3 or BDNF in the 293 cells. 

EXAMPLE 7 

Construction of proinsulin BiO ^ 
» Aep.rtic Acid mutant and insulin expression in 

transfected 293 cells. 

T he cDHA done of the human preproinsu in gene 

pSVEHIGDHFR. described in Australian patent 616 201 issued 

February 18. 1992. provided the coding sequence of the 
J5 numaTpreproinsulin gene for the final mammalian expression 

vector A second source of the human preproinsulin gene^ 
Sures et al. KiflK. 208:57-59 ,1980) , has also been 

used in experiments to provide the coding sequence of human 

proinstli^ Aliquots (5-10 MM of pSVEHIGDHFR were used 
* tor amplification of the coding region by RACE-PCR (rapid 
" ^ lotion of CD»A ends. Frohman „ M , ^ > ^ 

a combination of gene-specific PCR primers: for the 5 ena 

t Z V re CAT AAG CTT ACC ATG GCC CTG TGG ATG CGC (sequence 
SIT- to 3M.and for the end they were . CAT TC T « 

Suction of mismatchedbases during amplification £ 
template. All proinsulin clones were sequenced with the 
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Sequenase 2.0 kit from USB and cloned into restriction 
digested pRK5. The final preproinsuiin mammalian vector is 
pRK7.proins. 

A human proinsulin mutant having a non-naturally 

5 occurring prohormone convertase cleavage site is 
constructed by mutating the human proinsulin cDNA. 
pRK.proins. encoding the naturally occurring basic cleavage 
site at the B-chain/C-peptide junction (KTRR) and A- 
chain/C-peptide junction ( LQKR ) by site-directed 

10 mutagenesis (Kunkel Supra). The double proinsulin variant. 
proins.RTKR.lp/RQKR.Ilp. was constructed. Ip is the Type 
I enzyme cleavage site and Up is the Type II enzyme 
cleavage site. Primers used in construction of the 
proinsulin mutant. proins.RTKR.lp/RQKR.Ilp. are described 

15 in example 4. The double proinsulin mutant proins 
RTKR.Ip/RQKR.IIp changes the native human proinsulin 
residue Lys 29 to Arg 29 at the Type I enzyme cleavage 
site, residue Arg 31 to Lys 31 at the Type I enzyme 
cleavage site and proinsulin residue Leu 62 is changed to 

20 Arg 62 at the Type II enzyme cleavage site. Other 
proinsulin mutants, proins .RTKR. Ip. proins . RQKR .Hp, 
proins.KR.Ip/RQKR.IIp, and proins. KTKR.lp were constructed 
as in example 4. All mutants were screened through the 
primer regions with the Sequenase 2.0 kit from USB. 

25 Proinsulin, pRK7 . proins, and the proinsulin mutant. 

proins.RTKR.lp/RQKR.Ilp, were then mutagenized by the site- 
directed mutagenesis of Kunkel ec al. ( Wrhoflq F.navmo l. 
vol.154, pg. 367-382 [1987)) to yield pRK7 .proins. B10H>D, 
proinsulin having the histidine at position 10 in the B- 

30 chain replaced with an aspartic acid, and the proinsulin 
mutant. proins.RTKR.Ip/RQKR.IIp.B10H>D having the histidine 
'at position 10 in the B-chain replaced with an aspartic 
acid. The primer used in the Kunkel et al. Supra, method 
was GC-TTC-CAC-CAG-GTC-GGA-TCC-GCA-CAG-GTG. with the 

35 sequence given in a 5' to 3' direction. All mutants were 
screened through the primer regions with the Sequenase 2.0 
kit from USB. 

DNA from the proinsulin mutants, 
proins . RTKR . Ip/RQKR. UP, proins . RTKR . Ip/RQKR .Hp. B10H>D, 
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pR1 „.proins.B10H>D. and proinsulin. p^.prcins wasCsCl 
Ldea ana used for h,man embryonic kidney cell line^ 

rZTair— ».3-». »»•>• 

labelled as described by Gorman. Supra ana cells ana 
iysates ware analyzed for insulin-like proteins. Protein A 
sepharose immunoprecipit.tion was performed by the method 
of Harlow ana Lane (from BlMU ul , n W ,rorv mnua l. 
pub. COW spring Harbor Laboratory, chapter 11. PP^ 42 W 
° UU1 . on both supernatant and lysate samples ^ using 
oncentraced guinea pig anti-human ■ 
(Biomeda I Oil. immunoprecipitation products were 

, * ■ -h.^ jx Tris-Glvcine Sample Buffer (from 
resuspended in either 2X Tris Glyc ^ 

Hovexj/HM urea plus b-mercaptoethanol or 2X 
plus b-meroaptoethanol. heatea for 5 minutes at 100 C 
spun briefly, and run reduced on IS, 

16% Tricine gels, respectively, according to manufacturer s 
"actions ,-ovex,. The gels were fixed ,10% aceti^a . 
25 % isopropanol. 6S% water), and soaked « * *^ 
(Amersham, as suggested in the instructions Fixed, 
gels were exposed to film at -70 «c for 1 to 4 

m from the proinsulin mutants was used for transient 
transaction by the method of Gorman supra and twelve to 
^ours post transection, the -^"^ 
replaced with serum free media and secretion products were 
c llec ed for 36-56 hours. Processed insulin was measure 
in the collected supernatants by radioimmunoassay •») 
using the Equate® «A INSULIN. «. from Binax. »e. 
according to the provided instructions. 

TO detect the bioactivity and specific activity o 
' proinsulin or mutant proinsulin. secretion «™*>~J£ 
measured by quantitating the increase in tyrosine 
"sXylat- of the beta chainof the 
in 293 cells. 293 cells were plated in 24 well 
plates in F12-DMEH medium containing serum (10%) (50 000 
cells per well) and allowed to attach overnight Cells 
were transferred into medium without serum for 
serum free media was aspirated and cells were stimulated 
STl mL of various concentrations of wildtype bovine 
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insulin (Novo Induscri A/S) or mucanc insulin from 
conditioned supernatants. After a 10*15 minute incubation 
at 37°C, the reaction was stopped by the addition of 100 \il 
2X Tris-Glycine sample buffer (from Novex) plus b- 
5 mercaptoethanol (50 \il per mL c£ sample buffer). The 
samples were vortexed vigorously, heated for 5 minutes at 
100°C and 20&IL was electrophoresed on an 8% Tris-Glycine 
SDS gel (Novex), After electrophoresis, tyrosine 
phosphorylation of the b chain of the insulin receptor was 
10 detected and quantitated by immunoblotting with anti- 
phosphotyrosine antibodies followed by scanning 
densitometry as described by Holmes et al. fie i ence 

256:1205-1210 (1992J). 

NIH 3T3 HIR3.5 cells (Whitaker et al.PNftfi USA* 84: 

15 5237-5249 [1987]) overexpressing the human insulin receptor 
were incubated in serum free medium with various 
concentrations of unlabelled wildtype insulin or insulin 
mutant and a constant amount of ^-I- labelled Insulin 
(Amersham) for 16 hours at 4 °C. Unbound ligand was 

20 removed and cells were washed wich ice cold medium. The 
amount of radioactivity bound was determined after 
solubilizing the cells with 0.1 M NaOH containing SDS 
(0.1%). Relative binding of wildtype insulin and the 
insulin mutants was determined using a non-linear 

25 regression program. 

Determination of insulin receptor binding was 
determined by the following method: Human kidney cells 
(293) or 3T3 cells overexpressing the human insulin 
receptor, Whitaker et al . Supra were plated at about 

30 100,000 cell per well dishes. One to two days after 
plating, the cells were washed in PES and starved for 8-16 
hours in binding buffer (F12/DMEM (50/50) ;2mM Gin; 10 mM 
HEPES; 1 mg/rol BSA; penicillin and streptomycin (Gibco 600- 
5140 Ag 100 units/ml)- Cells were placed on ice and washed 

35 twice with cold binding buffer. 480^1 of non-radioactive 
insulin ligand (mutants or standards) in triplicate were 
placed in each well followed by 20pl (50,000-100,000 
cpra/well) 125 I -labelled insulin (at tyrosine-Al4 ) 
(Amersham). Cells were incubated at 4°C for 14-16 hours 
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>h oentle rocking. The media was removed and cells were 
" Id Wee wich cold binding buffer. The amounc of 
washed twice ^ en1ubilizi ng the 



cells with 0.1N NaOH containing SDS (0.1%). ' 
cells wic humgn xn8ulxn 

s curve was generated using u . t . .. c , 

5 curve * ct -andards {NOVO Industri A/S) . 

standards (OSP) or bovine standards two flpterminea 
w «f the insulin mutants was determined 

Relative binding of tne insuxj.* 

using a non-linear binding regression program. 

RESULTS 

io FTft Rfmlrs Tn 292 na/mL 
smpl. dil. CPM conc(uU) construct 

x 1:5 Ull GTS* PRKV.proins 

A 1:10 2111 1500-3000 PRK7.proins 



A 



l' :2 0 LOST 2750 PRK7.proins >63 

B 1:5 LOST P^.proins ™ 

8 1-10 2326 1500-3000 PRK7.proins >63 

3 1,50 3589 1500 PRK7.proins 63 

C 1:2 1472 GTS P RK7 .proins . 1" --- 

c 1:5 2770 675 P RK7 . proins . 1 27 

c 1:10 4464 600 pRK7.proins.l- 27 

I Zl LOST -- pRK7.proins.l ^ 

D i. 10 4215 650 P RK7.proms.l 27 

E l':10 LOST — ■ P*K7 .proins .2* — 

£ i-ioo 2311 >15,000 P RK7.proins.2 >625 

E l':500 LOST P*K7 .proins.2 — 



1:2 1405 GTS P RK7.proins.l 



E 1:1000 6596 20.000 pRK7 -proxns-2 833 

. 1-10 495 CTS pRK7.proins-2 

F x',100 UU <*S P RX7.proins.2 ~- 

., 1=500 5011 25,000 pRK7.proxns.2 1041 

, • i-1000 5552 35.000 pBX7.pr=ins.2 1458 

G 1=100 5548 3500 pKX7-.pr 01 ns.3 146 

G . .1=500 7023 5000 P BK7 .proxns.3 20 

i-1000 7055 8000 p*K7.proxns.3 333 

H 1 = 10 2021 -T- p.ua.proins.3 

H 1=100 LOST - pRK7.pro.ns. 

B 1=500 6828 • 7500 pRK7.prox»s.3 312 

H 1-1000 7255 7000 pBK7 .proxns . 3 291 
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I no dil LOST — concrol 

Z 1:5 LOST — control 

j no dil LOST — - concrol 

j 1:5 7600 LTS* control 

*pRK7. proins. 1 is proins . RTKR . Ip/RQKR . Hp 
*pRK7. proins. 2 is proins. BIO H>D. 
*pRK7. proins. 3 is proins . RTKR . Ip/KQKR . Hp . B10H>D. 
*GTS is greater than standard 
5 *LTS is less than standard 



RIA 


Results 


in 293 








smple dil. 


CPK 


cone (uU) 


construct 


ng/ml 


1 


1:10 


1200 


3000 


PRK7 . proins 


125 


1 


1:50 


2750 


2750 


PRK7 . proins 


115 


2 


1:10 


lost 





PRK7 . proins 




2 


1:50 


lost 





PRK7 . proins 


* — — — 


3 


no dil 


1522 


145 


PRK7. proins. 1" 


6 


3 


1:3 


2668 


180 


PRK7 . proins . 1 


7 .5 


3 


1:6 


lost 





PRK7 . proins . 1 




4 


no dil 


1581 


135 


PRK7 .proins. 1 


5 


4 


1:3 


2S51 


156 


PRK7.proins.l 


6.5 


4 


1:6 


4046 


210? 


PRK7 . proins . 1 


8.7 


5 


1:10 


542 


GTS" 


PRK7 . proins . 2 * 


" " " " 


5 


1:50 


1327 


13.500 


PF.K7 .proins . 2 


560 


5 


1:100 


1582 


20,000 


PRK7 . proins . 2 


O O T 


6 


1:10 


667 


GTS 


PRK7 . proins . 2 




6 


1:50 


1448 


12,500 


PRK7 . proins . 2 


520 


6 


1:100 


1470 


22,000 


PRK7 . proins . 2 


916 


7 


1:10 


366 


GTS 


PRK7 . proins . 3 * 




7 


1:50 


876 


GTS 


PRK7 . proins . 3 




7 


1:100 


1176 


GTS 


PRK7 . proins . 3 


>1125 


8 


1:10 


467 


GTS 


PRK7 . proins . 3 




.8 


1:50 


931 


GTS 


PRK7 . proins . 3 




8 


1 :100 


1038 


GTS 


PRK7 . proins . 3 


>1125 


9 


no dil 


713 


GTS 


PRK7 . proins . 4 * 




9 


1:3 


927 


GTS 


?RK7 . proins . 4 




9 


1:6 


1475 


840 


PRK? . proins . 4 


35 


10 


no dil 


416 


GTS 


PRK7 . proins . 4 
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10 


1:3 


1079 


GTS 






1435 


840 










11 


no QiX 






11 


1:3 


2686 


195 


11 


1:6 


3615 


270 


12 


no dil 


lose 


— 


12 


1:3 


1486 


435 


12 


1:6 


3807 


270 


13 


no dil 


840 


GTS 


13 


1:3 


776 


GTS 


13 


1:6 


2468 


420 


14 


no dil 


740 


GTS 


14 


1:3 


1217 





14 


1:6 


2368 


420 


15 


1:5 


931 


GTS 


15 


1:10 


1672 


2100 


16, 


all 


5800 


LTS* 
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PRK7 .proins.4 
PRK7. proins. 4 35 



PRK7. proins. 5 10-11 

PRK7 .proins . 5 10-11 

PRK7. proins. 5 10-11 

PRK7 . proins . 5 10-11 

PRK7 .proins . 5 10-11 
PRK7 .proins .6* 

PRK7 . proins .6 

PRK7 . proins .6 17 . 5 
PRK7 .proins . 6 
PRK7 . proins .6 

PRK7. proins. 6 17.5 
PRK7 . proins . 7 * 

PRK7. proins. 7 8.8 

control 0 



* P RK7.proin S .l is proins.RTKR.Ip/RQKR.HP 

* P RK7. proins. 2 is proins. B10 H>D. 

* pIUC7.proins.3 is proins. RTKR.ip/RQKR. lip. B10 K>D. 

* pRK7 .proins. 4 is proins . RTKR . Ip 

* P RK7. proins. 5 is proins . RQKR . Hp 

5 * P RK7.proins.6 is proins.KR.Ip/RQKR.HP 

* pRK7. proins. 7 is proins . KTKR . Ip 

* GTS is greater than standard 

* LTS is less than standard 

10 Expression of proinsulin GXDreS sion vector 

Following transfection with an e*P resS1 °* 
carrying the wildtype proinsulin cDNA, 293 HEK cells 
Resize and secrete uncleaved p" J 
Secular weight of 6.5 ». ^ 
1& proinsulin is immunoprecipitated fro, both the cell ly 

Ld the media using an insulin specific antibody. The 

processed material migrates with the 6.5* , 

LLr. Mature insulin, when reduced, migrates similarly 
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to the A and B chains at 2.35 and 3.4 kD when compared to 
the two bands of the molecular weight protein standards. 
Expression of Proinsulin Mutants 

Proins.RTKR.Ip/RQKR.Ilp is processed into products 

s that co-migrate with the A and B chains of insulin as well 
as processing intermediates. when proins . RTKR. Ip or 
proins.RQKR.Ilp. representing a mutant cleavage site at the 
B-/C-chain junction or the A-/C-chain junction 
respectively, are transfectec, only one of the mature 

10 chains was detected along with intermediates. When the 
mutant cleavage sice is at the A-/C-chain junction, the A 
chain is detected along with a slower migrating band 
consistent with the B-C intermediate. When the mutant 
cleavage site is at the B-/C-chain junction, the B chain is 

15 detected along with a faster migrating band consistent with 
the A-C intermediate. intermediates disappear when the 
amount of 293 homologous-cell enzyme that recognizes the 
mutant cleavage sites is increased by cotransf ection with 
an expression vector expressing the 293 homologous-cell 

20 enzyme. Proins . KTKR . Ip was resistant to cleavage by 
the 293 cell enzyme. 

Detection of Bioactivity 

When mature, active insulin binds to the A-chain of 

25 the insulin receptor outside the cell, tyrosine residues on 
the B-chain of the insulin receptor are aucophosphorylated 
(Kasuga et al., science . 215:185-186 11982)) Therefore, 
insulin bioactivity was assayed by measuring the ability of 
the B-chain of the insulin receptor to become 

30 autophosphorylated. Phosphorylation of the B-chain can be 
visually observed on an immunoblot. Upon stimulation with 
increasing concentrations of bovine insulin, the B-chain is 
increasingly observed at a mass of 96 kD by probing with 
antiphosphotyrosine antibodies. A similar increase in B- 

35 chain phosphorylation is detected by stimulation with 
various concentrations of the insulin mutants, 
proins .RTKR. Ip/RQKR. IIP and proins . RTKR . Ip/RQKR. Hp. BIO 
H>D. 
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a Tnaulia Binding 
Mi.tive specific Activity and I—" 

5 each insulin mutant chared to ">^"\ Ip/ROK R.IIp.BlO 

sr^r-i. = 

studies indicated similar binding 

, wildtype insulin " M "^TkZ *'***-™-™ ™> 
proins.BTKR.Ip/ROKR.HP and proins.RWR. P 

insulin mutants. 

in tne amount o £ processed „,suU» «•*» „ „ 

depending on transection •««"»^ ^ due to 

„ increase in the accumulation o£ «t» ve e ^ 

p „ins.K TOR 1P/K Q K R ."P B» > - ^ auces wlth ch e 
the accumulation o « 9 proinS ulin background. 

B10 H>D mutant in * ^^J^,, does „ 0 t yield 
"•"/rrJ • h — ' transaction wi* 

' 5 ~ yieia -ctable insulin -a 

Tn e Proins.RTKB.Ip/ROKR.xxp Bl H 

pr oins.KTKK.IP/^."P * — capacity 
Lsured by B-^.*^^.^,« nation 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2355 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



TCTAGATCTA 



CCTGGTGTGT CTCTGATCTT GCTTCTTTTC TCCCAGCCCT 50 

90 



TCCTACTTGT CTGAGAACAA GGTTTTGAGC CATGGAGCAA AGAGGTTGGA 100 

CTCTCCAGTG TACTGCTTTC GCCTTCTTTT CCCTTTGGTC TCCACTAACC 150 

ACTCTAAAAG CAAAGAGGCA GTTTGTTAAT GAATCGGCGG CGGAGATCCC 200 

CGGAGGGCAA GAAGCTGCCT CTGCCATCGC CGAAGAACTG GGGTATGACC 250 

TTTTGGGTCA GATTGGATCA CTTGAAAATC ACTATTTATT CAAACACAAA 300 

AGCCATCCTC GGAGGTCCCG AAGAAGCGCT CTTCATATCA CTAAGAGGTT 350 

ATCTGATGAT GATCGTGTGA CCTGGGCTGA ACAACAGTA? GAAAAAGAGA 400 

GAAGTAAACG TTCAGTTCAA AAAGACTCAG CATTGGATCT CTTCAATGA? 450 

CCAATGTGGA ATCAGCAGTG GTACTTGCAA GATACCAGAA TGACTGCAGC 500 

TCTGCCCAAG CTGGACCTTC ATGTAATACC TGTTTGGGAA AACGGTATTA 550 

CTCGCAAAGG AGTTGTTATT ACTGTACTGG ATGATGGCTT GCAGTGGAAT 600 

CACACAGACA TTTATGCCAA TTATGATCCA GAGGCTAGCT ATGATTTTAA 650 

CGATAATGAT CATGATCCAT TTCCCCGATA TCATCTCACA AATGAAAACA 700 

AACATGGAAC AAGATGTGCA CGTGAAATTG CCATGCAAGC AAATAATCAC 750 

AAGTGTGGGG TTGGAGTTGC ATATAATTCC AAAGTTGGAG GCATAAGAAT 800 

GCTGGATCGC ATTGTAACTG ATGCCATTGA GGCTAGTTCA ATTGGATTCA 850 

ACCCTGGCCA TGTGGATATT TACAGTGCAA GCTGGGGCCC TAATGATGAT 900 

GGAAAAACTG TGGAGGGGCC TGGCAGACTA GCCCAGAAGG CATTTGAATA 950 

TCGTCTCAAA CAGGGGAGAC AAGGGAAAGG CTCCATCTTT GTCTGGGCTT 1000 

CAGGGAATGG GGGTCGTCAG GGAGATAACT CTGACTGTGA TGGCTACACA 1050 

GACAGCATTT ACACCATCTC TATCAGCAGT GCCTCCCAGC AAGGCCTGTC 1100 
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ACCTTGGTAT GCAGAGAAGT GTTCTTCCAC ATTGGCTACC TCCTACAGCA 1150 

mum 



CTGGTGATTA CACAGACCAG CGAATAACAA GCGCTGACt. l buium* 
TGCACAGAGA CCCACACAGG CACCTCGGCT TCAGCACCCC TGGCTGCTGG 1250 

10 

TATCTTIGCT CTGGCCTTGG AGGCAAACCC AAATCTTACC TGGAGAGATA 1300 
15 TGCAGCATCT GGTTGTCTGG ACCTCTGAGT ACGACCCATT GGCCAGTAAC 1350 

CCAGGTTGGA AAAAGAATCC GGCACGCTTG ATGCTGAACA GCCGATTTGG 1400 
20 ATTTGGCTTG CTAAATGCCA AAGCTCTGGT GGATTTGGCT GATCCTCGGA 1450 

CCTGGAGAAA TGTGCCTGAG AAGAAAGAAT GTCTTGTAAA AGACAATAAC 1500 

25 

TTTGAGCCTA GAGCCCTGAA ACCTAATGGA CAAGTAATTG TTGAAATCCC 1550 
30 AACAAGAGCT TGTCAACGAC AAGAAAATGC TATCAAGTCT CTGGAACATG 1600 

TGCAATTTGA ACCAACAATT GAATATTCTC GTAGAGGAGA CCTTCATGTC 1650 
" ACACTCACTT CTGCTGTTGG AACCAGCACT GTACTGTTGG CTGAAAGGGA 1700 

AAGAGATACA TCCCCCAATC GCTTTAAGAA TTGGGACTTC ATGTCTGTTC 1750 

40 

ATACATGGGG AGAGAATCCT GTAGGCACCT GGACATTGAA AATTACAGAC 1800 
45 ATGTCTGGAA GAATCCAAAA TCAAGGAAGG ATTGTGAACT CGAAGTTCAT 1850 

TTTGCATGGG ACATCTTCTC AACCAGAGCA CATGAAGCAG CCCCGTGTGT 1900 
50 ACACATCCTA CAATACAGTC CAGAATGACA GCAGAGGAGT GGAAAAGATG 1950 

GTGAATGTTG TCGAGAAGCG GCCCACACAA AAGAGCCTGA ATGGCAATCT 2000 

55 

CCTCGTACCC AAAAACTCCA GCAGCAGCAA TGTGGAGCGT AGAAGCGATG 2050 
60 AGCAGGTACA AGGAACTCCT TCAAAGGCCA TGCTGCGACT CCTACAAAGT 2100 
GCTTTTAGCA AGAATGCACT TTCAAAACAA TCACCAAAGA AGTCTCCAAG 2150 
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TGCAAAGCTC AGCATCCCTT ATGAAAGTTT CTATGAAGCC TTCCAAAAGC 2200 
TTAACAAGCC CTCCAAGCTT CAAGCCTCTG AAGACACZCT GTACACTCAC 2250 
TATCTTGATG TATTCTATAA CACAAAACCT TATAAGCATA GAGATGACAG 2300 
GCTGCTGCAA GCTCTCATGG ACATCCTAAA TGAGGAGAAT TAAAATAAGG 2350 
15 AGCTC 2355 



10 



20 



35 



40 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2012 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
25 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
30 TCTAGATCCA TCTTCCCTCT TCGTCCCCTG CTCCACCACC CTCCCCGCCT 50 
CACAGCCCCGCTTTTCACTC CCAAAGAAGG ATCCAGCGCG CTTCTGGATC 100 
CCAGTGGAAG GCGGCCGGGT TCCTCTTCTG TGTCATCCTT TTTGCGTCTG 150 
CCGAGAGACC CGTCTTCACG AATCATTTTC TTGTGGAGTT CCATAAAGAC 200 
GGAGAGGAAG AGGCTCGCCA AGTTGCAGCA GAACACGGCT TTGGAGTCCG 250 
45 AAAGCTCCCC TTTGCAGAAG GCCTGTATCA CTTTTATCAC AATGGGCTTG 300 
CAAAGGCCAA AAGAAGACGC AGCCTACACC ATAAGCGGCA GCTAGAGAGA 350 
GACCCC AGGA TAAAG ATGGC GCTGCAACAA GAAGGATTTG ACCGTAAAAA 400 
CAGAGGGTAC AGGGACATCA ATGAGATTGA CATCAACATG AATGATCCTC 450 
TCTTTACAAA GCAATGGTAC CTGTTCAACA CTGGGCAAGC CGATGGAACT 500 
60 CCTCGGCTAG ACTTGAACGT GGCCGAAGCC TGGGAGCTGG GATACACAGG 550 
AAAAGGAGTC ACCATTGGAA TTATGGATGA TGGAATTGAC TATCTCCACC 600 
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CAOACCTCCC CTACAACTAC AACGCTCATC CAAGTTATCA CTTCAGCACC 650 
AATGACCCCT ACCCATACCC TCCATACACA GATGACTCCT TCAACAGCCA 700 
TO3AACTAGG TCTCCAGGAC AAGTTTCTGC TCCACCCAGC AACAATATCT 750 
GTGGAGTCGG CGTAGCATAC AACTCCAACC TGGCAGGCAT CCGGATOCTG 800 
15 GACCAGCCCT TTATGACAGA CATCATCGAA GCCTCCTCCA TCAGCCACAT 850 
GCCTCAACTC ATCGACATCT ACAGTGCAAG CTGGGCCCCC ACAGACAATG 900 
20 GGAAGACGGT TGATGGGCCC CGAGAGCTCA CACTCCAGGC CATGCCTGAT 950 

I 

GGCGTGAACA AGGGCCCTCG GGGCAAACGC AGCATCTATG TGTGGGCCTC 1000 

25 

TCGGGACGGT GGCAGCTACG ATGACTGCAA CTGTGACGGC TATGCTTCAA 1050 
30 GCATGTGGAC CATCTCCATC AACTCAGCCA TCAATGATGG CAGGACTGCC 1100 

TTGTATGATG AGAGTTGCTC TTCCACCTTA GCATCCACCT TCAGCAATGG 1150 
35 GAGGAAGAGG AATCCTGAGG CTGGTGTGGC TACCACAGAC TTGTATGGCA 1200 

ACTGTACTCT GAGACACTCT GGGACATCTG CAGCTGCTCC GGAGGCAGCT 1250 

40 

GGCGTCTTTG CATTAGCTTT GGAGCCTAAC CTGGATCTGA CCTCGAGAGA 1300 
45 CATGCAACAT CTGACTGTGC TCACCTCCAA GCGGAACCAG CTTCATGATG 1350 

AGGTTCATCA GTGGCGACGG AATCGGGTTG GCCTGGAATT TAATCACCTC 1400 
50 TTTGGCTACG GAGTCCTTGA TCCACGTCCC ATGGTGAAAA TGGCTAAAGA 1450 

CTCGAAAACT GTTCCGGAGA GATTCCATTG TGTGGGAGGC TCTGTGCAGA 1500 

ACCCTGAAAA AATACCACCC ACCGGCAAGC TGGTACTGAC CCTCAAAACA 1550 
$0 AATGCATGTG AGGGGAAGGA AAACTTCCTC CGCTACCTGG AGCACGTCCA 1600 

AGCTGTCATC ACAGTCAACG CGACCAGGAG AGGAGACCTG AACATCAACA 16S0 
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5 



10 



20 



TGACCTCCCC AATGGCCACC AAGTCCATTT TCCTGAGCCG GCGTCCCACA 1700 
GACGACGACT CCAAGGTGCG CTTTGACAAG TCCCCTTTCA TGACCACCCA 1750 
CACCTGGGGG GAGGATGCCC GAGGGACCTG GACCCTGGAG CTGGGGTTTG 1800 
TGGGCAGTGC ACCACAGAAG CGGTTCCTGA AGGAATGGAC CCTGATGCTT 1850 
15 CACGQCACAC AGAGCGCCCC ATACATCGAT CAGCTCCTCA GGGATTACCA 1900 
GTCTAAGCTG GCCATCTCCA AGAAGCAGGA GCTGGAGGAA GAGCTGGATG 1950 
AGGCTGTGGA GAGAAGCCTG CAAAGTATCC TGAGAAAGAA CTAGGGCCAC 2000 
GCTTCCGAAT TC 2012 

25 

(2) INFORMATION FOR SEQ ID NO:3: 

30 li) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 753 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID MO: 3: 

Met Glu Gin Arg Gly Trp Thr Leu Gin Cys Thr Ala Phe Ala Phe 
1 5 IC 15 

40 Phe Cys Val Trp Cys Ala Leu Ser Ser Vol Lys Ala Lys Arg Gin 

20 25 JU 

Phe Val Asn Glu Trp Ala Ala Glu lie Pro Cly Gly Gin Glu Ala 
35 4 ^ 

Ala Ser Ala lie Ala Glu Glu Leu Gly Tyr Asp Leu Leu Gly Gin 
50 55 60 

■lie Gly Ser Leu Glu Asn His Tyr Leu Phe Lys His Lys Ser His 
65 70 75 

Pro Arg Arg Ser Arg Arg Ser Ala Leu His lie Thr Lys Arg Leu 
80 85 v vv 

55 Ser Asp Asp Asp Arg Val Thr Trp Ala Glu Gin Glr. Tyr Glu Lys 

§5 100 AW3 

Clu Arg Ser Lys Arg Ser Val Cln Lys Asp Ser Ala Leu Asp Leu 
110 115 

Phe Asn Asp Pro Met Trp Asn Gin Gin Trp Tyr Leu Cln Asp Thr 
125 130 



45 



50 



60 
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Ara Met Thr Ala Ala Leu Pro Lys Leu ASp Leu His Val He Pro 
* 140 145 



^1^^liT■T■-^n t ^1V^'WV^^^*^- V,l - fflMS ^^ 




IS 



20 



30 



35 



45 



50 



€0 



Leu 



Asp Asp Gly Leu Glu Tr P Asn His Thr Asp lie Tyr Ala Asn 



170 



10 Tyr Asp Pro Glu Ala Ser Tyr Asp Phe Asn Asp Asn Asp His Asp 

185 

Pro Phe Pro Arg Tyr Asp Leu Thr Asn Glu Asn Lys His Gly Thr 

200 205 

Arg Cys Ala Gly Glu lie Ala Met Gin Ala Asn Asn His Lys Cys 



215 



Cly Val Gly Val Ala *yr Asn Ser Lye Val Gly Gly He Arg Met 
230 23b 

Leu Asp Gly lie Val Thr Asp Ala lie Glu Ala Ser Ser He Gly 
245 250 

25 Phe Asn Pro Gly His Val Asp He Tyr Ser Ala Ser Trp Gly Pro 



260 



Asn 



Lys 



ser 



Asn 



Asp Asp Gly Lys Thr Val Glu Cly Pro Gly Arg Leu Ala Gin 

230 



275 



Ala Phe Glu Tyr Gly Val Lys Gin Gly Arg Gin Gly Lys Gly 



290 



295 



40 He Ser Ser 



lie Phe Val Trp Ala Ser Gly Asn Gly Gly Arg Gin Gly Asp 
305 310 

Cys Asp Cys Asp Gly Tyr Thr Asp Ser lie Tyr Thr lie Ser 

320 32b 
Ala Ser Gin Gin Gly Leu Ser Pro Trp Tyr Ala Glu 



335 



340 



Lys Cys Ser Ser Thr Leu Ala Thr Ser Oyr Ser Ser Gly Asp Tyr 
350 35b 



Thr Asp Gin 



Arg He Thr Ser Ala Asp Lau His Asn Asp Cys Thr 



365 



370 



Glu Thr His Thr Gly Thr Ser Ala Ser Ala Pro Leu Ala Ala Gly 



He Phe 



330 385 

Ala Leu Ala Leu Glu Ala Asn Pro Asn Leu Thr Trp Arg 
395 400 * u:> 

55 Asp Met Gin His Leu Val Val Tr P Thr Ser Glu Tyr Asp Pro Leu 

410 

Ala Ser Asn Pro Gly Trp Lys Lys Asn Gly Ala Gly Leu Met Val 



425 



430 



Asn Ser Arg 



Phe Gly Phe Cly Leu Leu Asn Ala Lys Ala Leu Val 



440 



445 
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Asp Leu Ala Aso Pro Arg Thr Trp Arcj Asr ; V*l Pre Glu Lys Lys 

455 460 465 

Glu Cys Val Val Lys Asp Asn Asn Phe Glu Pro Arg Ala Leu Lys 

5 4 70 47S 480 

Ala Asn Gly Glu Val lie Val Glu He Pro Thr Arg Ala Cys Glu 

485 49C 495 

10 Gly Gin Glu Asn Ala He Lys Ser Leu Glu His Val Gin Phe Glu 

500 505 510 



15 



30 



35 



40 



45 



60 



Ala Thr He Glu Tyr Ser Arg Arg Gly Asp Leu His Val Thr Leu 

515 520 525 

Thr Ser Ala Val Gly Thr Ser Thr Val Leu Leu Ala Glu Arg Glu 

530 535 540 



Arg Asp Thr Ser Pro Asn Gly Phe Lys Asn Trp Asp Phe Met Ser 

20 545 550 555 

Val His Thr Trp Gly Glu Asn Pro Val Gly Thr Trp Thr Leu Lys 

560 565 570 

25 He Thr Asp Met Ser Gly Arg Met Gin Asn Glu Gly Arg He Val 

575 58C 585 



Asn Trp Lys Leu He Leu His Gly Thr Ser Ser Gin Pro Glu His 

590 595 600 

Met Lys Gin Pro Arg Val Tyr Thr Ser Tyr Asn Thr Val Gin Asn 

60S €10 615 

Asp Arg Arg Gly Val Glu Lys Mec Val Asr. Val Val Glu Lys Arg 

620 625 630 

Pro Thr Gin Lys Ser Leu Asn Gly Asn Leu Leu Val Pro Lys Asn 

635 640 645 

Ser Ser Ser Ser Asn Val Glu Gly Arg Arg Asp Glu Gin Val Gin 

650 655 660 

Gly Thr Pro Ser Lys Ala Met Leu Arc Leu Leu Gin Ser Ala Phe 

665 670 675 

Ser Lys Asn Ala Leu Ser Lys Gin Ser Pro Lys Lys Ser Pro Ser 

680 685 690 



Ala Lys Leu Ser He Pro Tyr Glu Ser Phe Tyr Glu Ala Leu Glu 
50 695 70G 705 

* * Lys Leu Asn Lys Pro Ser Lys Leu Glu Gly Ser Glu Asp Ser Leu 
710 % 720 

55 Tyr Ser Asp Tyr Val Asp Val Phe Tyr Asn Thr Lys Pro Tyr Lys 

725 7 30 735 



* His Arg Asp Asp Arg Leu Leu Gin Ala Leu Met Asp He Leu Asn 
740 745 7&0 

Glu Glu Asn 
753 
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(2) INFORMATION FOR SEQ ID NO: 4: 



(A) LENGTH: 637 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



10 



15 



20 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Het Glu Gly Gly Cys Gly Ser Gin Trp Lys Ala Ala Gly Phe Leu 
1 5 10 15 

Phe Cys Val Met Val Phe Ala Ser Ala Glu Arg Pro Val Phe Thr 
20 25 30 

Asn His Phe Leu Val Glu Leu His Lys Asp Gly Glu Glu Glu Ala 
35 40 45 

Arg Gin Val Ala Ala Glu His Gly Phe Gly Val Arg Lys Leu Pro 
50 55 60 

Phe Ala Glu Gly Leu Tyr His Phe Tyr His Asn Gly Leu Ala Lys 
65 70 75 

Ala Lys Arg Arg Arg Ser Leu His His Lys Arg Gin Leu Glu Arg 
30 85 90 



30 



35 



Asp Pro Arg He Lys Met Ala Leu Gin Gin Glu Gly Phe Asp Arg 

95 100 105 

Lys Lys Arg Gly Tyr Arg Asp He Asn Glu He Asp He Asn Met 

110 115 120 

Asn Asp Pro Leu Phe Thr Lys Gin Trp Tyr Leu Phe Asn Thr Gly 

125 130 135 



Gin Ala Asp Gly Thr Pro Gly Leu Asp Leu Asn Val Ala Glu Ala 

140 145 150 

40. Trp Glu Leu Gly Tyr Thr Gly Lys Gly Val Thr He Gly He Met 

155 160 165 



45 



Asp Asp Gly lie Asp Tyr Leu His Pro Asp Leu Ala Tyr Asn Tyr 

170 175 180 

Asn Ala Asp Ala Ser Tyr Asp Phe Ser Ser Asn Asp Pro Tyr Pro 

185 190 195 



2yr Pro Arg Tyr Thr Asp Asp Trp Phe Asn Ser His Gly Thr Arg 
50 200 205 210 

Cys Ala Gly Glu Val Ser Ala Ala Ala Ser Asn Asn He Cys Gly 
215 220 \ 225 

55 Val Gly Val Ala Tyr Asn Ser Lys Val Ala Glv He Arg Met Leu 

230 235 240 



60 



Asp Gin Pro Phe Met Thr Asp He He Glu Ala Ser Ser lie Ser 
245 250 255 

His Met Pro Gin Leu lie Asp lie Tyr Ser Ala Ser Trp Gly Pro 
260 265 270 
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Thr Asp Asn Gly Lys Thr Val Asp Gly Pre Arg Ciu Leu Thr Leu 
275 280 28b 

Gin Ala Met Ala Asp Gly Val Asn Lys Gly Arg Gly Gly Lys Gly 
5 290 295 300 

Ser He Tyr Val Trp Ala Ser Gly Asp Gly Gly Ser Tyr Asp Asp 
305 310 315 



10 



15 



30 



45 



60 



Cys Asn Cys Asp Gly Tyr Ala Ser Ser Met Trp Thr lie Ser He 
320 325 330 

Asn Ser Ala He Asn Asp Gly Arg Thr Ala Leu Tyr Asp Glu Ser 
335 340 345 

Cys Ser Ser Thr Leu Ala Ser Thr Phe Ser Asn Gly Arg Lys Arg 
350 355 360 



* Asn Pro Glu Ala Gly Val Ala Thr Thr Asp Leu Tyr Gly Asn Cys 
20 365 370 375 

Thr Leu Arg His Ser Gly Thr Ser Ala Ala Ala Pro Giu Ala Ala 
380 38$ 390 

25 Gly Val Phe Ala Leu Ala Leu Glu Ai& Asn Leu Asp Leu Thr Trp 

)gc iCO 40t> 



Arg Asp Met Gin His Leu Thr Val Leu Thr Ser Lys Arg Asn Gin 
410 415 420 

Leu His Asp Glu Val His Gin Trp Arg Arg Asn Gly Val Gly Leu 
425 430 435 



. Glu Phe Asn His Leu Phe Gly Tyr Gly Val Leu Asp Ala Gly Ala 
35 440 445 450 

Met Val Lys Met Ala Lys Asp Trp Lys Thr Val Pro Glu Arg Phe 
455 460 465 

40 His Cys Val Gly Gly Ser Val Gin Asn Pre Glu Lys lie Pro Pro 

470 475 480 



Thr Gly Lys Leu Val Leu Thr Leu Lys Thi Asn Ala Cys Glu Gly 
485 490 495 

Lys .Glu Asn Phe Val Arg Tyr Leu Glu His Val Gin Ala Val lie 
500 505 

Thr Val Asn Ala Thr Arg Arg Gly Asp Leu Asn lie Asn Met Thr 
50 515 520 525 

Ser Pro Met Gly Thr Lys Ser lie Leu Leu Ser Arg Arg Pro Arg 
530 535 - 540 

55 Asp Asp Asp Ser Lys Val Gly Phe Asp Lys Trp Pro Phe Met Thr 

545 550 



Thr His Thr Trp Gly Glu Asp Ala Arg Gly Thr Trp Thr Leu Glu 
560 565 5/0 

Leu Gly Phe Val Gly Ser Ala Pro Gin Lys Gly Leu Leu Lys Glu 
575 580 585 
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Trp rhr Leu Met Leu His Gly Thr Gin Ser Ala Pro Tyr He Asp 




Gin Glu Leu Glu Glu Glu Leu Asp Glu Ala Val Glu Arg Ser Leu 
620 €25 630 

10 Gin Ser He Leu Arg Lys Asn 

635 637 



15 



20 



(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 bases 

(B) TOPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 



GCAAAATCTA GAYKGCNATY GTNCAYGAKG GN 32 

25 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

* AAGCATGAGC TCNGGRGCRG CRGCNGANCC 30 ■ 
40 

(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 bases 
{B) TYPE: nucleic acid 

(C) -STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 
GATATCACTC AGATCGATGA ATTCGAGCTC 30 

55 



30 



35 



45 



50 ' 



(2) INFORMATION FOR SEQ ID NO: 8: 

€0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 bases 

(B) TYPE: nucleic acid* 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY : linear 
(Xi) SEQUENCE DESCRIPTION: SEQ ID MO: 8: 



5 



20 



35 



50 



AAGCTTTCTA GAGGATCCCT CTCCTGGATT TGG S3 



10 (2) INFORMATION FOR SEQ ID NO:S: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 
AAGCTTGAAT TCTCCAACCC CACACTTGTC 30 



25 (2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
AGATCGATGA ATTCGAGCTC 20 



40 (2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
CATTCTCGAA AAAAGAGACA A 21 



55 (2) INFORMATION FOR SEQ ID NO: 12: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: nucleic acid 
60 . (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO;l 



101 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 bases ^ 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



CATTCTAGAG CAAAGAGACA A 21 

20 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 21 bases 

<B) TYPE: nucleic acid 
(CJ STRANDEDNESS: single 
(D) TOPOLOGY: linear 

30 . (Xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 



CATTCTAGAA AAGCAAGACA A 21 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



CATTCTAGAA AAAGAGCACA A 21 



50 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 21 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

60 (Xi) SEQUENCE DESCRIPTION: SEQ ID N0:16 



ACCTGGAGCA AAGCTTCTCT G 21 
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(2) INFORMATION FOR SEQ ID NO: 17: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21- bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
10 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
15 ACCTOGAGCG CTAGGTCTCT G 21 



20 
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50 
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€0 



(2) INFORMATION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
30 CATAAGCTTA CCATGGCCCT GTCGATCCCC 3 0 



(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
40 (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19; 
45 CATTCTAGAC TAGTTGCAGT AGTTCTCC AG 3 0 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Lys Thr Arg Arg 
1 < 

(2) INFORMATION FOR SEQ ID NO:21: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 



(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:21: 



15 



20 



25 



30 



35 



40 



45 
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Leu Gin Lys Arg 
1 4 



10 (2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Arg Thr Lys Arg 
1 4 

(2 J INFORMATION FOR SEQ ID NO: 23: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Arg Gin Lys Arg 
1 4 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Lys Thr Lys Arg 
1 4 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
CTCTCCCTCC CGCTTGGTCC TGGGTGTGTA G 31 



60 



(2) INFORMATION FOR SEQ ID NO: 26: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 22 bases 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



CACGCTTCTG CCGGGATCCC TC 22 

10 



(2) INFORMATION FOR SEQ ID NO: 2*7: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2?: 



CTCTGCCTCC CGCTTGGTCT TCGGTGTGTA G 31 

25 



(2) INFORMATION FOR SEQ ID NO: 28: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



GATATGAAGA GCAGATCTTT TGGACCTCCG AGGATG 36 

40 



(2) INFORMATION FOR SEQ ID NO: 29: 

45 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 39 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
. ... . . ; . (D) TOPOLOGY: linear 

.50 

(Xi). SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
CTTATGGTGT AAGCTTCGTT TTGCTCTGGC CTTTGCAAG 39 

55 

(2) INFORMATION FOR SEQ ID NO:30: 

60 (i) .SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: nucleic acid' 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 




TACAACTCAC CGCGGGTCCT G 21 



10 (2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 38 bases 
(BJ TYPE: nucleic acid 
15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



AAGATGGGAT GGGATGATGA CCGTTTCCGC CTTGATGT 38 



25 (2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 



ACATCACGGC GGAAACGGTC ATCATCCCAT CCCATCTT 38 



40 (2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 bases 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 



GATATAAGCT TGAGAGTGTA GAAGGGGC 28 



55 (2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 bases 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
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GGCTTGACAT CATTGGCTGA CACTTTCGAA CACATGATAG AO 

5 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 41 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 



20 



35 



AAGATGGGAT GGGATGATGA GCGCCGGACC CTCATGGACA T 41 



(2) INFORMATION FOR SEQ ID NO: 35; 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 41 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

ATGTCCATGA GGGTCCGGCG CTCATCATCC CATCCCATCT T 41 



(2) INFORMATION FOR SEQ ID NO:37: 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 28 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



CATATAAGCT TGAGAGTGTA GAAGGGGC 28 

50 

(2 ). INFORMATION FOR SEQ ID NO:38: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 51 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

60 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 



TGAGCGACAG CACCCCCTTG GAGCCCCCGC CCTTGTATCT CATGGAGGAT 50 

107 



WO 93/ 11 -M/ 



T 51 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 51 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

15 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

TCAGCTCCCC TCGTCGGCCG GGGTCCGAGT GCCGTTTCCG CCGTGATGTT 50 

20 

C 51 



25 (2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 bases 

(B) TYPE: nucleic acid 
3Q (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
" ACGTGGGCAG CCCCGTCGTC GCGAACAGAA CATCACGGCG GAAACGGC 



40 (2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 bases 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
50 TCTTCGCCAC CACGGGGCTG CCCACGTAAT CCTCCATCAG ATACAAGG 



55 (2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: nucleic acid 
6 q (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
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CTCGGACCCC GCCCGACGAG CGGAGC 26 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 27 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 (Xi) SEQUENCE DESCRIPTION: SEQ ID NC:43; 



GCGGGGGCTC CAAGGGGGTG CTGTCGC 27 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 54 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

30 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 



GTAGAACAAC ATGGACATGG TGGCAATATT CTCGACTCTG GAGTCGACCT 50 

35 

GCAG 54 



40 (2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 bases 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
50 CACATAAAAC AAGATCGACA TGGTCTTGTT CACCTGTAGG ATCCCCGG 48 



55 (2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 bases 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS: single 

(D) TOPOLOGY i linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
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AGTAAGGAAA AGGATCGTCA TGGTGGAGCT CGACAAGCTT GAGAATTCAA 
TCG 53 



10 (2) INFORMATION FOR SEQ ID NO: 47: t 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Xaa Xaa Xaa Arg 
20 1 4 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NOr48: 

30 

His Ser Arg Lys Lys Arg Gin 
15 7 

(2) INFORMATION FOR SEQ ID NO: 49: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 



His Ser Val Lys Lys Arg Gin 
1 5 7 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

* (Xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: 

55 His Ser Arg Ala Lys Arg Gin 
1 5 7 

(2) INFORMATION FOR SEQ ID NO: 51: 

60 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

His Ser Arg Lys Ala Arg Gin 
5.1 57 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

His Ser Arg Lys Arg Ala Gin 
1 5 7 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
(6) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 

Thr Trp Ser Lys Ala Ser Gin 
1 5 7 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

40 Thr Trp Ser Ala Arg Ser Gin 
1 5 7 
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We claim: 

1 a method for the production of a heterologous 

cell comprising: 

a) introducing into the polypeptide factor-dependent 
host cell nucleic acid encoding a heterologous polypeptide 
factor precursor comprising a cleavage site heterologous to 
the polypeptide factor and recognizable by a host cell 
enzyme and wherein said host cell is dependent on the 
cleavage product of said polypeptide factor precursor; and 

b) culturing said host cell under conditions wherein 
the polypeptide factor precursor is expressed and cleaved 
at said cleavage site by the host cell enzyme, thereby 
producing said polypeptide factor 

2. The method of claim 1. wherein said cleavage site 
recognizable by said host cell is a prohormone hormone 
convertase cleavage site. 

20 3 The method of claim 2 wherein said prohormone 
convertase cleavage site is ZXZR (Seq ID #47), wherein Z is 
LYS or ARG; X is any amino acid; and R is ARC 

4. The method of claim 1 wherein said host cell is 
25 mammalian. 

5. The method of claim 4 wherein said host cell is a 
Chinese hamster ovary cell. 

30 6. The method of claim 1 further comprising introducing 
into said host, cell nucleic acid encoding a selectable 
gene. 

7. The method of claim 1 wherein said polypeptide factor 
35 precursor is proinsulin and said polypeptide factor is 

insulin. 

8. The method of claim 7 wherein said nucleic acid 
encoding proinsulin is operably linked to a promoter. . 
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9. The method of claim 8 wherein said promoter is 
inducible . 

10. The method of claim 1 wherein said nucleic acid 
5 encoding a polypeptide factor precursor is 

proins . RTKR . Ip/RQKR .Hp. 

11. The method of claim 1 wherein said nucleic acid 
encoding a polypeptide factor precursor is 

10 proins. RTKR. Ip/RQKR. Up. BIO H>D. 

12. The method of claim 1 comprising the additional steps: 
a) further introducing into the host cell nucleic 

acid encoding a desired polypeptide; and 
15 b) culturing said host cell under conditions wherein 

said desired polypeptide is expressed. 

13. The method of claim 12 wherein said desired 
polypeptide is selected from the group comprising: relaxin, 

20 insulin-like growth factor I and II, growth hormone; factor 
VIII; factor IX; tumor necrosis factor-alpha and -beta; 
tissue factor protein; inhibin; activin; vascular 
endothelial growth factor; thrombopoietin; nerve growth 
factor; platelet-derived growth factor; fibroblast growth 

25 factor; epidermal growth factor; transforming growth 
factor; insulin-like growth factor-I and -II; interferon; 
GM-CSF; G-CSF; interleukin; decay accelerating factor; and 
atrial natriuretic peptides A, E or C. 

30 14. The method of claim 12 further comprising recovering 
said desired polypeptide 

15. The method of claim 12 wherein said desired 
polypeptide further comprises a precursor region having a 
35 second cleavage site recognizable by a host cell enzyme and 
wherein the precursor is cleaved at said second cleavage 
site t>y the host cell enzyme, thereby producing said 
desired polypeptide. 
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16 The method of claim 1 comprising the additional steps: 

a) further introducing into the host cell nucleic 

b) culturing said host cell under conditions wherein 
5 said host cell enzyme is expressed. 

17. A method for producing a desired polypeptide in a 

host cell, comprising: 

a) introducing into the host cell nucleic acid 
10 encoding the desired polypeptide precursor having a 

cleavage site not recognizable by any endogenous host cell 
• enzyme and recognizable by enzyme heterologous to said host 

cell; , • ' „<h 

b) introducing into said host cell nucleic acid 

XS encoding the enzyme heterologous to said host cell and 
wherein said enzyme heterologous to said host cell further 
comprises a precursor having a cleavage site recognizable 
by a host cell enzyme, and 

c) culturing said host cell under conditions wherein 
20 said precursor of enzyme heterologous to said host cell is 

expressed and cleaved at said cleavage site by host cell 
enzyme thereby producing said heterologous enzyme and 
wherein said desired polypeptide precursor is expressed and 
cleaved at said cleavage site -by said enzyme heterologous 
25 to said host cell thereby producing said desired 
polypeptide. 

18. The method of claim 17 further comprising recovering 
said desired polypeptide. 

19 The method of claim 17 wherein the desired polypeptide 
is'relaxin, growth hormone; factor VIII; factor IX; tumor 
necrosis factor-alpha and -beta; tissue factor protein; 
inhibin; activin; vascular endothelial growth factor; 
35 thrombopoietin; nerve growth factor; platelet -derived 
growth factor; fibroblast growth factor; epidermal growth 
factor; transforming growth factor; insulin-like growth 
factor-I and -II; interferon; GM-CSF; G-CSF; interleukin; 
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decay accelerating factor; and atrial natriuretic peptides 
A, b or c. 

20. The method of claim 17 wherein the enzyme heterologous 
5 to said host cell is a prohormone convertase. 

21. The method of claim 20 wherein said prohormone 
convertase is mammalian prohormone convertase 1 or 
mammalian prohormone convertase 2 . 

10 

22. The method of claim 17 wherein said host cell is a 
polypeptide- factor dependent host cell. 

23. The method of claim 22 furcher comprising introducing 
15 into the polypeptide factor-dependent host cell nucleic 

acid encoding a heterologous polypeptide factor precursor 
further comprising a cleavage site recognizable by a host 
cell enzyme or a heterologous enzyme added to said host 
cell, and culturing said host cell under conditions wherein 
20 the polypeptide factor precursor is cleaved at the cleavage 
site thereby producing said polypeptide factor. 

24. The method of claim 23 wherein said polypeptide factor 
precursor is encoded by proins.RTKR.Ip/RQKR.llp.BlO H>D. 

25 

25. The method of claim 17 further comprising introducing 
into the host cell nucleic acid encoding a selectable gene. 

26. The method of claim 17 wherein said host cell is 
30 mammalian. 

27. The isolated proinsulin encoded by the nucleic acid, 
proins.RTKR.Ip/RQKR.llp.BlO H>D. 

35 28. The isolated nucleic acid proins.RTKR.Ip/RQKR.llp.BlO 
H>D. 
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TCTAGATCTA GCTGGTGTGT CTCTGATCTT GCTTCTTTTC TCCCAGCCCT 50 
TCCTACTTGT GTGAGAACAA GGTTTTGAGC CATGGAGCAA AGAGGTTGGA 100 
CTCTGCAGTG TACTGCTTTC GCCTTCTTTT GCGTTTGGTG TGCACTAAGC 150 
AGTGTAAAAG CAAAGAGGCA GTTTGTTAAT GAATGGGCGG CGGAGATCCC 200 
CGGAGGGCAA GAAGCTGCCT CTCCCATCGC CGAAGAACTG GGGTATGACC 250 
TTTTGGGTCA GATTGGATCA CTTGAAAATC ACTATTTATT CAAACACAAA 300 
AGCCATCCTC GGAGGTCCCG AAGAAGCGCT CTTCATATCA CTAAGAGGTT 350 
ATCTGATGAT GATCGTGTGA CGTGGGCTGA ACAACAGTAT GAAAAAGAGA 400 
GAAGTAAACG TTCAGTTCAA AAAGACTCAG CATTGGATCT CTTCAATGAT 450 
CCAATGTGGA ATCAGCAGTG GTACTTGCAA GATACCAGAA TGACTGCAGC 500 
TCTGCCCAAG CTGGACCTTC ATGTAATACC TGTTTGGGAA AAGGGTATTA 550 
CTGGCAAAGG AGTTGTTATT ACTGTACTGG ATGATGGCTT GGAGTGGAAT 600 
CACACAGACA TTTATGCCAA TTATGATCCA GAGGCTAGCT ATGATTTTAA 650 
CGATAATGAT CATGATCCAT TTCCCCGATA TGATCTCACA AATGAAAACA 700 
AACATGGAAC AAGATGTGCA GGTGAAATTG CCATGCAAGC AAATAATCAC 750 

AAGTGTGGGG TTGGAGTTGC ATATAATTCC AAAGTTGGAG GCATAAGAAT 800 
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FIG. 1B 

GCTGGATGGC ATTGTAACTG ATGCCATTGA GGCTAGTTCA ATTGGATTCA 850 
ACCCTGGCCA TGTGGATATT TACAGTGCAA GCTGGGGCCC TAATGATGAT 900 
GGAAAAACTG TGGAGGGGCC TGGCAGACTA GCCCAGAAGG CATTTGAATA 950 
TGGTGTCAAA CAGGGGAGAC AAGGGAAAGG CTCCATCTTT GTCTGGGCTT 1000 
CAGGGAATGG GGGTCGTCAG GGAGATAACT GTGACTGTGA TGGCTACACA 1050 
GACAGCATTT ACACCATCTC TATCAGCAGT GCCTCCCAGC AAGGCCTGTC 1100 
ACCTTGGTAT GCAGAGAAGT GTTCTTCCAC ATTGGCTACC TCCTACAGCA 1150 
GTGGTGATTA CACAGACCAG CGAATAACAA GCGCTGACCT GCACAATGAC 1200 
TGCACAGAGA CCCACACAGG CACCTCGGCT TCAGCACCCC TGGCTGCTGG 1250 
TATCTTTGCT CTGGCCTTGG AGGCAAACCC AAATCTTACC TGGAGAGATA 1300 
TCCAGCATCT GGTTGTCTGG ACCTCTGAGT ACGACCCATT GGCCAGTAAC 1350 
CCAGGTTGGA AAAAGAATGG GGCAGGCTTG ATGGTGAACA GCCGATTTGG 1400 
ATTTGGCTTG CTAAATGCCA AAGCTCTGGT GGATTTGGCT GATCCTCGGA 1450 
CCTGGAGAAA TGTGCCTGAG AAGAAAGAAT GTGTTGTAAA AGACAATAAC 1500 
TTTCAGCCTA GAGCCCTGAA AGCTAATGGA GAAGTAATTG TTGAAATCCC 1550 
AACAAGAGCT TGTGAAGGAC AAGAAAATGC TATCAAGTCT CTGGAACATG 1600 

TGCAATTTGA AGCAACAATT GAATATTCTC GTAGAGGAGA CCTTCATGTC 1650 

2/15 



FIG. 1C 

ACACTCACTT CTGCTGTTGG AACCAGCACT GTACTGTTGG CTGAAAGGGA 1700 
AAGAGATACA TCCCCCAATG GCTTTAAGAA TTGGGACTTC ATGTCTGTTC 1750 
ATACATGGGG AGAGAATCCT GTAGGCACCT GGACATTGAA AATTACAGAC 1800 
ATGTCTGGAA GAATGCAAAA TGAAGGAAGG ATTGTGAACT GGAAGTTGAT 1850 
TTTGCATGGG ACATCTTCTC AACCAGAGCA CATGAAGCAG CCCCGTGTGT 1900 
ACACATCCTA CAATACAGTC CAGAATGACA GGAGAGGAGT GGAAAAGATG 1950 
CCTGGTACCC AAAAACTCCA GCAGCAGCAA TGTGGAGGGT AGAAGGGATG 2050 
AGCAGGTACA AGGAACTCCT TCAAAGGCCA TGCTGCGACT CCTACAAAGT 2100 
GCTTTTAGCA AGAATGCACT TTCAAAACAA TCACCAAAGA AGTCTCCAAG 2150 
TGCAAAGCTC AGCATCCCTT ATGAAAGTTT CTATGAAGCC TTGGAAAAGC 2200 
TTAACAAGCC CTCCAAGCTT GAAGGCTCTG AAGACAGTCT GTACAGTGAC 2250 
TATGTTGATG TATTCTATAA CACAAAACCT TATAAGCATA GAGATGACAG 2300 
GCTGCTGCAA GCTCTCATGG ACATCCTAAA TGAGGAGAAT TAAAATAAGG 2350 
AGCTC 2355 



3/15 



WO 93/11247 PCT/US92/10621 

S^AGA^CA TCTTCCCTCT TCGTCCCCTG CTCCACCACC CTGCGCGCCT 50 
CACAGCCCCG CTTTTCACTC CCAAAGAAGG ATGGAGGGCG GTTGTGGATC 100 
CCAGTGGAAG GCGGCCGGGT TCCTCTTCTG TGTGATGGTT TTTGCGTCTG 150 
CCGAGAGACC CGTCTTCACG AATCATTTTC TTGTGGAGTT GCATAAAGAC 200 
GGAGAGGAAG AGGCTCGCCA AGTTGCAGCA GAACACGGCT TTGGAGTCCG 250 
AAAGCTCCCC TTTGCAGAAG GCCTGTATCA CTTTTATCAC AATGGGCTTG 300 
CAAAGGCCAA AAGAAGACGC AGCCTACACC ATAAGCGGCA GCTAGAGAGA 350 
GACCCCAGGA TAAAGATGGC GCTGCAACAA GAAGGATTTG ACCGTAAAAA 400 
GAGAGGGTAC AGGGACATCA ATGAGATTGA CATCAACATG AATGATCCTC 450 
TCTTTACAAA GCAATGGTAC CTGTTCAACA CTGGGCAAGC CGATGGAACT 500 
CCTGGGCTAG ACTTGAACGT GGCCGAAGCC TGGGAGCTGG GATACACAGG 550 
AAAAGGAGTG ACCATTGGAA TTATGGATGA TGGAATTGAC TATCTCCACC 600 
CAGACCTGGC CTACAACTAC AACGCTGATG CAAGTTATGA CTTCAGCAGC 650 
AATGACCCCT ACCCATACCC TCGATACACA GATGACTGGT TCAACAGCCA 700 
TGGAACTAGG TGTGCAGGAG AAGTTTCTGC TGCAGCCAGC AACAATATCT 750 
GTGGAGTCGG CGTAGCATAC AACTCCAAGG TGGCAGGGAT CCGGATGCTG 800 

GACCAGCCCT TTATGACAGA CATCATCGAA GCCTCCTCCA TCAGCCACAT 850 
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GGAAGACGGT TGATGGGCCC CGAGAGCTCA CACTCCAGGC CATGGCTGAT 950 
GGCGTGAACA AGGGCCGTGG GGGCAAAGGC AGCATCTATG TGTGGGCCTC 1000 
TGGGGACGGT GGCAGCTACG ATGACTGCAA CTGTGACGGC TATGCTTCAA 1050 
GCATGTGGAC CATCTCCATC AACTCAGCCA TCAATGATGG CAGGACTGCC 1100 
TTCTATGATG AGAGTTGCTC TTCCACCTTA GCATCCACCT TCAGCAATGG 1150 
GAGGAAGAGG AATCCTGAGG CTGGTGTGGC TACCACAGAC TTGTATGGCA 1200 
ACTGTACTCT GAGACACTCT GGGACATCTG CAGCTGCTCC GGAGGCAGCT 1250 
GGCGTGTTTG CATTAGCTTT GGAGGCTAAC CTGGATCTGA CCTGGAGAGA 1300 
CATGCAACAT CTGACTGTGC TCACCTCCAA GCGGAACCAG CTTCATGATG 1350 
AGGTTCATCA GTGGCGACGG AATGGGGTTG GCCTGGAATT TAATCACCTC 1400 
TTTGGCTACG GAGTCCTTGA TGCAGGTGCC ATGGTGAAAA TGGCTAAAGA 1450 
CTCGAAAACT GTTCCGGAGA GMTCCATTG TGTGGGAGGC TCTGTGCAGA 1500 
ACCCTGAAAA AATACCACCC ACCGGCAAGC TGGTACTGAC CCTCAAAACA 1550 
AATGCATGTG AGGGGAAGGA AAACTTCGTC CGCTACCTGG AGCACGTCCA 1600 

AGCTGTCATC ACAGTCAACG CGACCAGGAG AGGAGACCTG AACATCAACA 1650 
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FIG. 2C 

TGACCTCCCC AATGGGCACC AAGTCCATTT TGCTGAGCCG GCGTCCCAGA 1700 
GACGACGACT CCAAGGTGGG CTTTGACAAG TGGCCTTTCA TGACCACCCA 1750 
CACCTGGGGG GAGGATGCCC GAGGGACCTG GACCCTGGAG CTGGGGTTTG 1800 
TGGGCAGTGC ACCACAGAAG GGGTTGCTGA AGGAATGGAC CCTGATGCTT 1850 
CACGGCACAC AGAGCGCCCC ATACATCGAT CAGGTGGTGA GGGATTACCA 1900 
GTCTAAGCTG GCCATGTCCA AGAAGCAGGA GCTGGAGGAA GAGCTGGATG 1950 
AGGCTGTGGA GAGAAGCCTG CAAAGTATCC TGAGAAAGAA CTAGGGCCAC 2000 
GCTTCCGAAT TC 2012 
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Phe Cys Val Trp Cys Ala Leu Ser Ser Val Lys Ala Lys Arg Gin 

20 25 30 

Phe Val Asn Glu Trp Ala Ala Glu He Pro Gly Gly Gin Glu Ala 

35 40 45 

Ala Ser Ala He Ala Glu Glu Leu Gly Tyr Asp Leu Leu Gly Gin 

50 55 60 

He Gly Ser Leu Glu Asn His Tyr Leu Phe Lys His Lys Ser His 

65 70 75 

Pro Arg Arg Ser Arg Arg Ser Ala Leu His He Thr Lys Arg Leu 

80 85 90 

Ser Asp Asp Asp Arg Val Thr Trp Ala Glu Gin Gin Tyr Glu Lys 

95 100 105 

Glu Arg Ser Lys Arg Ser Val Gin Lys Asp Ser Ala Leu Asp Leu 

110 115 120 

Phe Asn Asp Pro Met Trp Asn Gin Gin Trp Tyr Leu Gin Asp Thr 

125 130 135 

Arg Met Thr Ala Ala Leu Pro Lys Leu Asp Leu His Val He Pro 

140 145 150 

Val Trp Glu Lys Gly He Thr Gly Lys Gly Val Val He Thr Val 

155 160 165 

Leu Asp Asp Gly Leu Glu Trp Asn His Thr Asp He Tyr Ala Asn 

170 175 180 

Tvr Asp Pro Glu Ala Ser Tyr Asp Phe Asn Asp Asn Asp His Asp 

185 190 ' 195 

Pro Phe Pro Arg Tyr Asp Leu Thr Asn Glu Asn Lys His Gly Thr 

200 205 210 

Arg Cys Ala Gly Glu He Ala Met Gin Ala Asn Asn His Lys Cys 

215 220 225 

Gly Val Gly Val Ala Tyr Asn Ser Lys Val Gly Gly He Arg Met 

230 235 240 
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Leu Asp Gly He Val Thr Asp Ala lie Glu Ala Ser Ser He Gly 

245 250 255 

Phe Asn Pro Gly His Val Asp lie Tyr Ser Ala Ser Trp Gly Pro 

260 265 270 

Asn Asp Asp Gly Lys Thr Val Glu Gly Pro Gly Arg Leu Ala Gin 

275 280 285 

Lys Ala Phe Glu Tyr Gly Val Lys Gin Gly Arg Gin Gly Lys Gly 

290 295 300 

Ser lie Phe Val Trp Ala Ser Gly Asn Gly Gly Arg Gin Gly Asp 

305 310 315 

Asn Cys Asp Cys Asp Gly Tyr Thr Asp Ser lie Tyr Thr lie Ser 



3?0 



325 



He Ser Ser Ala Ser Gin Gin Gly Leu Ser Pro Trp Tyr Ala Glu 

335 340 345 

Lys Cys Ser Ser Thr Leu Ala Thr Ser Tyr Ser Ser Gly Asp Tyr 



350 



355 



Thr 



Asp Gin Arg lie Thr Ser Ala Asp Leu His Asn Asp Cys Thr 



365 



370 



Glu Thr His Thr Gly Thr Ser Ala Ser Ala Pro Leu Ala Ala Gly 

380 385 390 

He Phe Ala Leu Ala Leu Glu Ala Asn Pro Asn Leu Thr Trp Arg 

395 400 405 

... Asp Met Gin His Leu Val Val Trp Thr Ser Glu Tyr Asp Pro Leu 

410 415 

Ala Ser Asn Pro Gly Trp Lys Lys Asn Gly Ala Gly Leu Met Val 

425 430 435 

Asn Ser Arg Phe Gly Phe Gly Leu Leu Asn Ala Lys Ala Leu Val 



440 



445 



Asp Leu Ala Asp Pro Arg Thr Trp Arg Asn Val Pro Glu Lys Lys 



455 



460 



Glu Cys Val Val Lys Asp Asn Asn Phe Glu Pro Arg Ala Leu Lys 



470 



475 
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FIG. 3C 

Ala Asn Gly Glu Val He Val Glu He Pro Thr Arg Ala Cys Glu 

485 490 495 

Gly Gin Glu Asn Ala He Lys Ser Leu Glu His Val Gin Phe Glu 

500 505 510 

Ala Thr He Glu Tyr Ser Arg Arg Gly Asp Leu His Val Thr Leu 

515 520 525 

Thr Ser Ala Val Gly Thr Ser Thr Val Leu Leu Ala Glu Arg Glu 

530 535 540 

Arg Asp Thr Ser Pro Asn Gly Phe Lys Asn Trp Asp Phe Met Ser 

545 550 555 

Val His Thr Trp Gly Glu Asn Pro Val Gly Thr Trp Thr Leu Lys 

560 565 570 

He Thr Asp Met Ser Gly Arg Met Gin Asn Glu Gly Arg He Val 

575 580 585 



Asn Trp Lys Leu He Leu His Gly Thr Ser Ser Gin Pro Glu His 

590 595 600 

Met Lys Gin Pro Arg Val Tyr Thr Ser Tyr Asn Thr Val Gin Asn 

605 610 615 

Asp Arg Arg Gly Val Glu Lys Met Val Asn Val Val Glu Lys Arg 



620 



625 



Pro thr Glh Lys Ser Leu Asn Gly Asn Leu Leu Val Pro Lys Asn 

635 640 645 

t 

Ser Ser Ser Ser Asn Val Glu Gly Arg Arg Asp Glu Gin Val Gin 

650 655 660 

Gly Thr Pro Ser Lys Ala Met Leu Arg Leu Leu Gin Ser Ala Phe 

Jf ?nr\ 675 



665 



670 
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FIG. 3D 



Ser Lys Asn Ala Leu Ser Lys Gin Ser Pro Lys Lys Ser Pro Ser 



680 



685 



Ala Lys Leu Ser lie Pro Tyr Glu Ser Phe Tyr Glu Ala Leu Glu 



695 



700 



Lys 



Leu Asn Lys Pro Ser Lys Leu Glu Gly Ser Glu Asp Ser Leu 



710 



715 



720 



Tyr Ser Asp Tyr Val Asp Val Phe Tyr Asn Thr Lys Pro Tyr Lys 

725 730 

His Arg Asp Asp Arg Leu Leu Gin Ala Leu Met Asp lie Leu Asn 



Glu Glu Asn 
753 
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